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International Journal of Dravidian Linguistics 


INTRODUCTION 


Franklin C. Southworth 
Mahadev L. Apte 


The South Asian! area has been, for at least several millenia, 
the homeland of a number of distinct ethnic and linguistic groups. 
Languages of at least four distinct stocks have coexisted here since 
prehistoric times; Indo-Aryan (a branch of Indo-European), 
Dravidian, Munda, and Tibeto-Burman. Attention was focussed on 
South Asia as a “linguistic area’’” by Murray Emeneau’s classic paper 
(Emeneau 1956), in which Emeneau plunged straight forwardly into 
the old Boas-Sapir controversy on the diffusion of linguistic traits 
across genetic boundaries. He pointed out that SA languages provide 
a rich source of data, with a sizeable time-depth, for the examination 
of the diffusion question. In subsequent papers (1969, 1971 b, and 
his contribution to this volume), he has explored the historical 
implications of particular linguistic traits which are shared by 
languages of different SA families, providing a wealth of detailed 
information on the diffusion of structural features in the area, and 
on the contexts in which this diffusion has occurred. His research in 
this field has led him to some very specific hypotheses about Indian 
prehistory, which will be discussed further in (6) below. This volume 
owes its general inspiration, as well as the specific stimulus for 
particular papers, to Emeneau’s pioneering work. 


(1) Convergence 


The papers in this volume are concerned with a phenomenon 
which has been noticed by many observers of the South Asian scene, 
including both laymen and scholars. Those who have learned one 
SA language, and then set out to learn another, have often been 
struck by the unexpected similarities they find. These similarities are 
even more startling when the two languages belong to different 
families. For example, to a student of Hindi-Urdu who has mastered 
word order, has struggled with the semantic complexities of the tense- 
aspect system and the compound verb, has learned to use the gerund 


th, Apte 
ijdl Vol. III No. 1 ib Southwor ip 


construction, the emphatic particles, and echo-formanane pS 
paanii-vaanii ‘water (paanii) and the like’, (rofii-votii) ‘bread He 
and the like’, it may come as a bit of a shock when learning Tamu a 
Telugu to find all these familiar phenomena in a thinly peek 
form-down to echo formations such as Tamil kaapi-kiippi coffee 
(kaappi) and the like’ (see SOUTHWORTH* for further detail on 
these and other features). 

These similarities began to attract the attention of western 
scholars, especially those interested in the study of Sanskrit, from a 
fairly early period. (See the references cited in EMENEAU and 
KUIPER, as well as Emeneau 1956, 1971b, Krishnamurti 1969, 
Southworth 1971a) The papers in this volume represent a 
continuation of the work of scholars of previous generations, and in 
some cases fill in considerable details about aspects of convergence 
which have been previously known or suspected, but not systematically 
reported. BENDIX and PANDIT report on specific cases of conver- 
gence involving languages of different families. MASICA describes 
area-wide syntactic features which set the SA area apart from 
neighbouring languages. APTE describes what appears to be an 
ongoing process, akin to pidginization, by which ‘‘Bombay Hindi- 
Urdu” has come to resemble Marathi in many features. DE SILVA 
and ULLRICH report on different aspects of convergence between 
varieties of the same language in a diglossia__ relationship. 
EMENEAU's paper can be regarded as the latest in a series of detailed 
examinations of features shared by Dravidian and Indo-Aryan, and 
their historical implications. KUIPER and SOUTHWORTH also 
deal with the history of Dravidian influence on Indo-Aryan. (The 


historical interpretation of convergence in South Asia, which is 
fraught with controversy, is discussed below.) 


What do we mean by ‘convergence’ in the South Asian 
context? Primarily, we are referring to the distinctive characteristic 
of a ‘linguistic area’, which Emeneau has defined as “an area Which 
includes languages belonging to more than one family but showing 
traits in common which are found not to belong to the other members 
of (at least) one of the families” (1956:16, note 28). In other words 
we refer to resemblances between languages which are the result of 
contact rather than common origin, such as numeral Classifiers in 
IA languages (Emeneau 1956), which Clearly originated in the eastern 
end of the sub-continent and are typically associated with languages 
of the Sino-Tibetan groups. Convergence also takes place beam 
related languages, though it jis less Striking and often taken fe 
granted by linguistic historians. Such convergence is no less wou 


* ee j Vora aan: 
Names of authors in capitals refer to the papers in this volume 
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of study than that between languages of different families, and we 
presume that the process involved is essentially the same. Thus, an 


expression like kuthe zaauun raahilaas ‘Where are you going ?’ 
in Nagpur Marathi (=Poona Marathi kuthe zZaatoes) appears to owe 
its resemblance to constructions like Hindi Urdu kah@@ jaa rahaa 
hai, in the same way and for the same reasons that Saurashtri 
constructions resemble Tamil (see PANDIT). 

As in other areas of the world, SA languages abound in cases 
of word borrowing. The principal sources have been the classical 
languages (Sanskrit, Arabic, Persian, Classical Tamil), the modern 
languages of colonial domination (English, Persian, Portuguese), and 
to some extent the modern SA vernaculars themselves. The borrowing 
of surface lexical forms (with or without change of meaning) is, 
however, quite distinct from the question of structural convergence. 
In fact, in some cases the shared structural traits have diffused in the 
opposite direction from the main current of word borrowing. Thus, 
the modern literary and elite forms of Dravidian languages such as 
Telugu, Kannada, and Malayalam have borrowed massively from 
Sanskrit, whereas much of the structural convergence has involved 
the adoption of non-IA traits by JA languages (see below). As a 
result some IA languages, such as Marathi (Southworth 1971 a), 
appear to have significant portions of their phonological, syntactic, 
and semantic structure from one source, while their lexical form 
(including basic morpheme shapes, allomorphic variants, and morpho- 
phonemic alternations) derive from another source. The ‘‘borrowed’’ 
structural features in such cases are generally expressed by ‘‘inherited”’ 
morphs, as in the case of Bengali jon ‘person’ (Skt. jana, cf. Latin 
genus etc.) used as a numeral classifier, or by the [A forms discussed 
in the first part of EMENEAU’s paper. 


MASICA notes similarities in word order throughout the 
SA area, and shows that these traits mark off the area from neigh- 
boring ones. Though some transformationalists, as he points out, 
have claimed that there are deeper syntactic processes which are 
not necessarily affected by this convergence, we question whether 
the criteria for identifying these deeper processes have been suffici- 
ently developed. If we do ever reach a stage where itis possible 
to identify deep processes which reflect genetic stock, to more 
superficial features which are affected by convergence, this will 
indeed be interesting. In order todo this reliably, however, we 
must find methods for identifying such features independently of 
the investigator's knowledge of genetic identification. In any case, 
given the limited possibilities of ordering, it is questionable whether 
features of order will be of much use for genetic classification. 
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¢ A area which hav 
Two cases from the SA a They. represent 


reported on have excited considerable attention. ce 
extreme forms of the developments under discussion here. os 
are (a) the case of a village in the Marathi-speaking area whic 
Gumperz & Wilson (1971) have called Kupwar, and (6) the case 
of Saurashtri described by PANDIT (see also Pandit 1972). In ie 
Kupwar case, Gumperz & Wilson report that the three SS , 
spoken in the village — Kannada (Dravidian), Marathi (IA), an 
Urdu (IA) — are substantially identical in phonology, syntax, and 
semantics, and differ only in morphophonemics (what we have 
called ‘lexical form’ above): 


For many Kupwar residents, especially men, a model of ling- 
uistic competence must comprise a single semological, a single 
syntactic, and a single phonetic component, and alternative 
sets of rules for the relation of semantic categories to 
morphemic shapes. (Gumperz & Wilson 1971: 165) 


In the case of Saurashtri, a form of Gujarati spoken by a group 
who immigrated to Tamilnadu some centuries ago, PANDIT finds 
that a segment of the grammar, the system of number names, is 
identical in both Saurashtri and Tamil, whereas. the lexical 
forms of Saurashtri numerals are mainly IA. In another 
study (Pandit 1972), he has discussed a number of other con- 
Structions and arrived at similar conclusions. Gumperz & Wilson 


make the following comment onthe social function of situations 
like that of Kupwar: 


This linguistic situation seems uniquely suited to the social 
Situation... While language distinctions maintained, actual 
Messages show word-for-word or morph-for-morph translat- 
ability, and speakers can therefore switch from one code to 
another witha minimum of additional learning (1971: 164-5) 


PANDIT also links the development of the Saurashtri situation to 
the need for maintenance of group identity through language, and 
the consequent need for constant Switching. He also points out 
that the form of Saurashtri he has described can be learned by a 
Tamil speaker or a Gujarati speaker in the Space of a few weeks.2 


To summarize, ‘convergence’ in the South Asian context 
refers to a tendency to carry over phonetic, grammatical and 
semantic features of one language into another While retainths the 
lexical form of the second language more or less intact ‘(We 
cannot say at this point, either in general or in particular cases 
how much of this carrying Over is done by native speakers of which 
language.) Under certain circumstances (see (5) below for aks 
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discussion), the underlying systems of the languages involved 
approach identity. 


(2) Types of convergence 

The term ‘convergence’ reflects a variety of processes, which 
can be differentiated along a number of different dimensions. The 
usefulness of this term as a general concept is still open to question. 
A brief look at some of the different ways of classifying convergence 
may help to clarify the subject. 


(a) relationships between the languages involved in convergence 

Here we can distinguish between internal and external con- 
vergence, the former involving varieties of the “same language’’, 
particularly those in a diglossia relationship. Such convergence 
can given rise to new forms which, once stabilized, start the trend 
toward convergence again (DE SILVA). On the other hand, the 
initial convergence between varieties of the same language can lead 
to morphological co-existence (ULLRICH), which may be an earlier 
stage of the convergence process. It may also be useful to 
distinguish between related and unrelated languages in the convergence 
process (compare APTE’s Bombay Hindi Urdu, involving related 
languages, with a similar phenomenon in Indian English discussed 
by SOUTHWORTH; cf. also Gumperz 1964). A further distin- 
ction is involved in the case of Indian English, namely that between 
domestic languages and imported languages; this is also relevant to 
the influence of Persian on Hindi-Urdu and Marathi. 


(b) Structural effects of convergence 

Hymes (1971 : 74-77) has presented a method of classifying 
structural effects of convergence. Hymes’ scheme depends ona 
division of a language into four components: phonetics, lexicon 
(what we have called ‘lexical form’ above, and what Gumperz & 
Wilson call ‘morphophonemics’), syntax, and semantics. Taking 
these components in all-or-nothing fashion, Hymes arrives at a 
hypothetical total of 14 types (over half of which are uninhabited), 
depending on which of the components is preserved and which 
replaced by convergence. The SA cases tend toward Hymes’ type 
13, in which lexicon is preserved and all else changes. This may 
be compared for example with type 4, exemplified by “European 
intertranslatability’, in which there tends to be convergence in 
semantics, with the other three elements remaining largely unaffected. 
The notion of taking each component as a whole is useful from a 
typological point of view, but needs of course to be modified in 
any attempt to deal with the reality of a particular situation, as 
noted in (c) and (d) below. 
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(c) Degree of convergence 


It is also necessary to recognize differences in the eee! of 
convergence, as illustrated by the variety of descriptions pss Me: 
the papers in this volume. For example, in contrast to sta eme 
like that of Gumperz & Wilson quoted below, BENDIX (section ) 
‘describes the Nepali-Newari situation in the following _terms : 
‘‘Nepali and Newari do not exhibit the spectacular relationship, 
reportedly found elsewhere in South Asia, of being virtually one 
code with two expression systems’, One might say of BENDIX S 
account, which focusses on similarities in verbal semantics, that 
it shows how close two languages can come without becoming 
identical. His discription of subject-verb concord (4.3), for example, 
shows Nepalito bein an intermediate position between. Newari 
(and other Tibeto-Burman languages) and IA languages like Hindi- 
Urdu. This feature of Nepali had earlier been described by South 
worth (1967: 14, note 10) as being essentially like that of Hindi- 
Urdu except for the ordering of concord rules. A question worth 
investigating is whether, from the transformational-generative point 
of view, convergence can be usefully described in terms of such 
changes in rule structure, and whether this may provide a possible 
means of quantifying the degree of change. 


BENDIX’s comment also raises the question whether the type 
of resemblance he has described, or the type of areal phenomena 
described by MASICA, may be simply an intermediate Step along the 
way to the complete convergence (the identity of all but the terminal 
symbols, in generative terms) described by PANDIT and by Gumperz 
& Wilson, This may be true, but the Situation is probably more 
complicated than that. There is clearly a great deal of convergence 
which historically antedates the Kupwar and Saurashtri Situations. 
Thus, many cases of the type reported for these Situations, e. g. 
morph-by-morph translateable sentences, can also be found between 
non-contiguous and unrelated languages such as Marathi and Tamil 
(see SOUTHWORTH, section (3), for an example), On the other 
hand, many cases which appear at first glance to be identical are not 
completely so. For example, Southworth (1971) has pointed out 
that Marathi uses two inherited IA morphemes to express a distinction 
apparently borrowed from Dravidian, namely that between inclusive 
and exclusive first-person plural pronouns: aamhi 
‘we-exclusive’, aapan (cf. Skt. aaiman) ‘we-inclusive’. 
second form, aapan, also has other meanings (‘sel f*: ‘you’ (formal) - 
cf. Hindi-Urdu aap) which are not traceable to Dravidian inf 
Clearly, then, more careful investigation of hiss: Cae 


But the 


many of these cases is 


(cf. Skt, asmaakain)- 


¥ ” , i aa aT a es 
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needed before we can speak of establishing -a linear continuum of 

degrees of convergence. It will also be necessat y to examine more 

cases in detail in order to establish the extent to which semantic 

convergence, such as that described by BENDIX, is independent of 


certain types of syntactic convergence (e. g. similar order of morphs 
in verbal constructions), 


(d) Direction of convergence 


Any particular instance of convergence can usually be described 
in one of three ways: X converges toward (i.e. adopts a feature or a 
rule of) Y; Y converges toward X; or X and Y move together in a 
new direction. With regard to the treatment of any component of a 
language see (2b) above) asa whole, it may often be possible to 
State that the structure of one or the other languages is dominant, by 
examining the different changes that occur with respect to that 
component. Thus, PANDIT clearly finds Tamil syntactic structure 
to be dominant in the case of Saurashtri. On the other hand, APTE 
does not find the situation to be as clearcut in the case of Bombay 
Hindi-Urdu. Admittedly, the similarity between the “standard’’ 
forms of Marathi and Hindi-Urdu is very great to start with. 


(e) Convergence and internal variation 


Another point which needs discussion is the reiationship 
between dialectal variations within individual languages and deve- 
lopments due to convergence. For example, the change due to 
convergence is often contrasted with the existing state of the two 
languages involved in the contact situation. However, we need to 
fiad out if only the ‘standard’ variety of these languages is taken 
into consideration. In other words, if diglossic situations exist, 
then the change may have occurred with regard to the ‘high’ variety 
or varieties which are often over-differentiated. But what appears 
to be a change from such a comparison may not be change at all 
when compared with the ‘low’ varieties, since the ‘high’ and ‘low’ 
varieties within the same language may have considerable difference. 
To give an example, standard Marathi has the phonemic distinction 
between the affricates c/c and j/j in certain environments. Telugu, 
a contiguous language, has sucha distinction only at the allophonic 
level. Now if certain Marathi dialects show the change from 
phonemic to allophonic status or conversely, some Telugu dialects 
show achange from allophonic to phonemic status, we need to 
examine the dialects involved in the situation before arguing that 
such a change is due to contact and convergence, because it may be 
that although standard Marathi makes this distinction, it may not 
be part of the phonological system of that particular dialect of 
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Marathi involved in the contact situation. Reverse would be the 
case for Telugu. Ineither of these cases change due to contact has 
taken place only with reference to standard Marathi, or Telugu, 
but not with reference to the particular dialects involved. 


(3) The social context of linguistic convergence in South Asia 


Are there features of the SA social situation which can be 
considered responsible for the particular type of convergence which 
has occurred here ? First of all, the cases described in this volume 
involve considerable variety of conditions, In the case of Saurashtri, 
we are dealing with a relatively small, isolated group of immigrants 
with no great social prestige. Kupwar is a case of interaction on 
the village level, where the village elite speak Kannada at home, 
whereas Marathi is the language of prestige in the general area. The 
Situation described by BENDIX occurs in a context of centuries of 
contact in the urban area of Kathmandu and its surroundings, as 
well as earlier contact involving other languages than the two he 
describes. APTE describes a lingua franca which has grown up in 
commercial and urban settings, between two Closely related languages, 
and whicd apparently is not used by anyone as a home language.? 
The exisitence of such differences Suggests the need for an adequate 
terminology to deal with the Social background of convergence, It 
is as yet too early to attempt to develop such a terminology, or 
to do the needed work of investigating which social factors 
correlate with which kinds of convergence. A priori, factors like 
the relative sizes of gstoups, their relative prestige, the area of 
contact, the nature of interaction, relative isolation of a group 
from its home area, etc., are likely to be important. In addition, 
changing circumstances (migration, growing need of communication, 
changes in social Structure, the pragmatic need for bilingualism in 
border areas, etc.) will need to be taken into account. 


On the other hand, are there any features common to all the 
situations described in South Asia? There seems to be one 
principal one, which has been singled out by Gumperz & Wilson 
namely the maintenance of a sharp distinction between interaction 
within the home and Outside of it, or as they put it, the “ethnic 
separateness of home life’? (1971:153), They note that 
or values require strict Separation between public and 
kin group) spheres of activity”, and that, while there 


“local norms 
Private (intra- 
is considerable. 


» Nevertheless “Speech in the home, especially to 


women and children, js exclusively in the home. >» 
They conclude : ome-group language’, 
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As long as ethnic separatencss of home life 

is valued, and language remains associated 

with ethnic separateness, there is little reason 

to expect multilingualism to‘disappear. 

(Gumperz & Wilson 1971 : 153-4) 
Their comment on the function of convergent codes (quoted in 
section (1) above) implies further that, as long as multilingualism 
survives in this context, the socia] factors leading to linguistic con- 
vergence will be present. 


Taking ‘‘ethnic separateness of home life’? as the major 
variable which has led to the SA convergence phenomena leaves 
us, however, with several problems. For one thing, we need to know 
why this separateness is so persistent in South Asia as compared to 
other parts of the world. Also, why does ethnic separateness 
elsewhere not produce similar results? A look at those groups who 
have maintained linguistic separateness in various parts of South 
Asia leads to the suggestion that most of them are rather small 
groups who could be said to have some particular reason for 
remaining separate, either for reasons of prestige (as in the case of 
the Tamil-speaking Aiyar Brahmins of Kerala, or the Brahmins of 
Tamilnad with their highly sanskritized Tamil), or because ofa par- 
ticular occupational identification (e. g. artisans like goldsmiths or 
carpenters), or because of enforced isolation (as in the case of the 
Nayadis of Kerala, who were forced to observe ‘‘distance-pollution’’). 
Note that the Tulu-speaking Embrandiris of Kerala, many of 
whom have abandoned their original language and adopted 
Malayalam, have consistently emulated Nambudiri behavior in 
other respects, and some of them call themselves Nambudiris, though 
the two groups do not intermarry (Thurston 1909: 2:209). Of the 
non-Marathi-speaking groups described by Gumperz & Wilson in 
Kupwar, all can be said to come under this description, i. e. all 
are relatively small groups who have nothing to gain by abandoning 
their home language for Marathi. The Jain and Moslem land- 
owners, the prestigious groups in the village, have no reason to 
emulate Marathi-speaking Hindu elites in other villages. The 
Kannada-speaking artisans keep themselves separate from other 
non-landowning groups by their language, and the Telugu rope- 
makers may have that motivation as well as that of enforced 
isolation (traditional untouchability). Thus the statement about 
ethnic separateness of home life being valued seems to apply 
mainly to minority groups, and has no particular meaning for the 
majority - which in most South Asian villages consists of landless 
agricultural laborers. (See SOUTHWORTH, section (5), for 
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. . . ~ f 
discussion of the implications of this question for the loss o 
Dravidian languages in the IA area.) 


(4) Convergence and pidginization-creolization 

As we attempt to define the nature of convergence, we also 
need to discuss the related problem, namely that of the correlation 
between convergence on the one hand, and the processes of pidginiz- 
ation and creolization on the other. From one point of view, it 
can be argued that the concept of convergence isa broader one, 
subsuming both pidginization and creolization, among others. That 
is, there are a number of different processes which involve conver- 
gence, and which can be considered as diiferent kinds of convergence. 
Another way of looking at itis that of Hymes (1971 :83-5), who 
regards pidginization and creolization as ‘“‘complex sociolinguistic 
processes*’, which can be analyzed into various elementary processes 
(linguistic and/or social). Convergence is one of these processes 
(though as noted above, itis also analyzable in various ways). 


Hymes distinguishes between pidginization as a process 
(involving convergence, and reduction in the inner form of the langu- 
age, in the social context of restricted use) and a pidgin, which 
refers to the result of such a process when it has crystallized or 
‘achieved autonomy as a norm”. A similar distinction is made 
between the process of creo/ization (involving convergence and 
expansion in inner form, in the social context of extended use) 
and a creole, A key pointis that, while pidginization and creoliz- 
ation appear to be fairly common processes, they only rarely lead 
to the creation of autonomous pidgins or creoles. (This apparently 
requires rather special social circumstances, one of which is appar- 
ently the presence of at least three distinct linguistic groups.) 


Hymes, along with others, regards reduction or Simplification 
of structure as a major characteristic of pidgins. However, conver- 
gence May not necessarily lead to reduction of Structure, Jt is in 
this context that the distinction made by Samarin (1971) between 
Salient and substantive pidginization is relevant. While salient 
pidginization involves extensive borrowing, substantive pidginization 
involves simplification (Samarin 1971: 119). In the case ‘of Hindi 
spoken in Bombay (Apte) both Seem to occur, In the case of 
Saurashtri however, it is difficult to determine what | 
ootkeapie daha exactly happened. 

y case, the similarities ang differences betw idginizati 
res cen pidginization 

‘Avergence need to be brought out clearly. 


Reduction” and “simplification’’ also need to be dist 


ed carefully, to the extent this is possible. inguish- 


Generally Speaking the 
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distinction is clear. Reduction refers to a decrease in the number 
of semantically distinct alternatives available, ¢.g. the merger of 
first-person singular and plural pronouns which occurred in eastern 
IA languages. Simplification refers to a decrease in non-distinctive 
Surface variants, asin the case of baby-talk analogical formations 
like foots or I sleeped. What shall we say of cases like the loss of 
grammatical gender distinctions (see Gumperz & Wilson 1971: 156-7)? 
The linguist’s attitude toward such a change will presumably depend 
on how he decides to describe gender, whether it isacategory of 
surface structute or of deep structure. 


Convergence in the SA scene occurs accompanied by certain 
of the processes discussed by Hymes' In Bombay Hindi-Urdu there 
is acertain amount of morphological reduction, along with some 
autonomous developments, as indicated by APTE. SOUTHWORTH 
points to similar developments in uneducated Indian English. The 
development of Hindustani Urdu, as we know it in the modern 
scene, could almost be described as a partial relexification (replace- 
ment of vocabulary) of north-central NIA based on colloquial 
Persian, Its later extreme form, Urdu, can be considered creolized 
in the terms of Hymes’ definition. The same can be said of the 
literary forms of “‘Hindi’’ and the other NIA languages, which have 
begun to converge in recent decades asa result of massive infusions 
of Sanskrit vocabulary (partially relexification, partially expansion 
of semantic resources), and the adoption of Sanskrit morpho- 
phophonemic rules, compounding processes, etc. DE SILVA has 
given a detailed description of this process for Sinhalese. 


Southworth (1971) has noted the similarity between the 
changes which occurred in MIA (morphological reduction) and 
changes characteristic of pidgins. It is possible to regard the 
replacement of the old inflectional system by new means (auxiliary 
verbs, postpositions) as coming within Hymes’ definition of creoliza- 
tion, The evidence for general processes of simplification (e. g. the 
loss of copula found commonly in Marathi, and occasionally in 
Hindi-Urdu-cf. Ferguson 1971) rests on the same data, and unfor- 
tunately our knowledge of the earlier stages of the IA languages is 
not at present adequate to make a definitive statement. 


(5) Creolization without pidginization? 

The earliest evidence for convergence in South Asia is that pre- 
sented by KUIPER. These rather minimal changes probably occurred 
in a gradual, orderly way in a fairly stable social situation. There is 
no reason to suspect that the society in which these changes took place 
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was affected by the kinds of social upheaval or mass movements of 
people which are found in the typical pidgin-creole situation (cf. 
Alleyne’s paper in Hymes 1971). Thus it would seem necessary to 
recognize a process which we can all slow convergence in the SA 
context. It is entirely conceivable that this process alone could 
lead ultimately to the Kupwar and Saurashtri developments discussed 
above. As Gumperz & Wilson point out, there is no reason to 
suspect the existence of a pidgin stage at any point in the history of 
Kupwar. The same would appear to apply in the Saurashtri case. 
As Gumperz & Wilson have put it: 


The Kupwar varieties have processes of reduction and con- 
vergence suggestive of pidginization and creolization. To say 
that the varieties have in fact undergone these processes would 
of course be misleading, if creolization is definded as requiring 
a pidgin as its starting-point... The present state of the langu- 
age is creole-like, in that one finds grammatical structure and 
lexical shape pointing to different sources, quite like the stereotype 
of a pidgin or creole as the words of one language used with 
the grammar of another... The need for constant code-switching 
and for mutual adaptation within a situation in which home 
languages are maintained has led to reduction and adaptation in 
linguistic structure. 


(Gumperz & Wilson 1971: 165) 


SOUTHWORTH, viewing the long-range picture, arrives at a 
Somewhat different view. He suggests that the orderly slow conver- 
gence described by KUIPER was probably accompanied by reduced 
forms like that described by APTE. With the population move- 
ments found in the later prehistoric period (the first millenium B.C. 
and later), it is likely that some pidginization (extreme reduction 
with some degree of autonomous development) occurred along tare 
routes and in ethnically mixed settlements. We note that the 
history of urbanization in the Gangetic plains begins in this period 
The question is then whether slow convergence can (or does in fact) 
ever occur Without accompanying pidginization. There are two 
possible negative answers to this question. (1) It is possible that the 
conditions which produce convergence inevitably or usually produce 
some pidginization (though not necessarily the development of 
autonomous pidgins) as a by product, especially on the geographical 
and social periphery of the society. In extremely settled Bonin 
probably such pidginization would be kept to a minimum 2) Tee | 
eee vane that pidginization is a prerequisite for ae = 
Cast the most extreme forms of convergence, SOUTHWORTH 
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Suggests this when he postulates that the development of IA langu- 
ages resulted from a process of internal convergence between 
conservative and highly pidginized varieties spoken by different 
segments of the population. On the other hand, other cases of 
convergence (e.g. the semantic convergence labelled ‘“Ruropean 
intertranslatability’” by Hymes) do not seem to require such an 
assumption. Thus, in any case, it is necessary to specify the social 
conditions under which these things are likely to happen. This, as 
fas as we know, is a field which is wide open to research. 


An important question which has not, as far as we know, 


been explicitly addressed, is the role of contact in convergence, as 
opposed to the usual pidgin situation which seems to involve a brief 


initial period of language contact. When we find a common core 


of features in an area like South Asia, we would like to know 
what parts of it have resulted from independent evolution in each 
language and in each family, and how much of it is due to continuing 
contact. KUIPER and others have maintained, for example, that 
the retroflex-dental contrast in Sanskrit goes back to the Vedic 
period, and has resulted from contact with Dravidian languages. 
Since the frequency of retroflex consonants in IA languages increases 
with time, the present state of affairs would seem tobe a result 
both of the initial contact and the continuing contact between IA 
and Dravidian speakers. 


The possibility of a cyclic development must also be taken 
into account. Convergence brings two language varieties together, 
thus creating a common core. Over a period of time, the new 
varieties begin to show signs of independent development, which may 
later be overlaid by anew period of convergence. The first part of 
DE SILVA’s paper is intended primarily to ‘‘illustrate the generaliz- 
ability of the nature of convergence irrespective of the chronological 
distance involved in the defining situations’. Such developments 
occur not only in isolated cases, but in many situations involving 
various languages across geographic boundaries and language families, 
Southworth’s attempt with respect to Marathi (1971) illustrates 
the possibility of examining these processes at long range, though 
the questions he raises cannot be resolved without a clearer picture 
of the relationship between creolization and convergence. 


Future discussions of these phenomena will need a more 
developed terminology for dealing with these processes, plimarily 
because there are more different kinds of processes than can be 
handled with the terms available. Bendix (personal communication) 
has pointed out, for example, that the time relationships between 
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the various elementary processes discussed by Hymes are not usually 
specified. Thus, in the classical use of creolization as a post-pidgin 
process, it is normally assumed that convergence took place in the 
earlier (pidgin) stage, whereas Gumperz and Wilson use the term 
to describe a process which includes (simultaneous) convergence. 
Clearly, the issue goes beyond terminology. Accepting a particular 
set of definitions (such as that of Hymes) is perhaps a necessary 
Step, but the next step would be to go on describing the details of 
what has happened while we grope toward a new and more com- 
prehensive terminology. In the cases discussed here, we need more 
facts in order to know what has happened. We need to observe 
What happens in play-groups in Kupwar and in Saurashtri, how 
much pidginization occurs at different ages, etc. We need to know 
more of the social variation in Kupwar, whether more pidginized 
varieties exist at different social levels. If this volume serves to 
Stimulate a recognition of the need for this type of work, it wil] 
have served one of its main purposes. 


(6) The use of linguistic evidence in the study of prehistory 


The distribution of shared linguistic traits, including 
borrowed words, is one of the most important sources for the 
prehistory of South Asia. Emeneau has shown this in his various 
papers, which are important not only for the facts presented and 
the ingenious use of linguistic data, but also because he has 
established “‘linguistic archaeology’”’ as an independent auxiliary of 
history. On the linguistic side, he has focussed attention on the 
need to orient diachronic studies toward the linguistic area concept, 
and to get out of the narrow rut of “genetic” orientation (see section 
(8) below). Kuiper, who shares with Emeneau the distinction of 
being among the few scholars to bridge the linguistic gap between 
Dravidian and Indo-Aryan, has contributed importantly to our 
knowledge on the early period of language contact, both in his 
Paper reprinted here and in others listed in general bibliography, 


The close contact and widespread bilingualism among [ndo- 
Aryan and Dravidian-speaking groups in the Vedic Period seems 
well-established by the evidence presented by KUIPER and 
EMENEAU (see also Emeneay 1956, 197] P). The fact that 
Dravidian influence appears in the Rigveda, the oldest and most 
sacred text of Hinduism, Strongly suggests that this contact e a 
= a cae of the Society, as well as Possibly on ae 
evels, "$ new evidence on 
evidence of the Pre-Aryan Origin of cob eee ee 
that the institution existed in the early Indus Valley civilben de 
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Various pieces of linguistic evidence point to a discontinuity between 
the original contact situation, which seems to have remained stable 
over a long period of time, and a new period of contact centered 
in the Gangetic plain. EMENEAU’s discussion of the “uneven 
disintegration” of the semantic structure of the PDr. *-um suffix 
leads him to postulate a ‘‘differential extinction of Dravidian in 
different sub-areas, earlier in the Ganges valley...”". SOUTHWORTH, 
on the basis of entirely different evidence, suggests that the 
Dravidian component was much less significant (possibly from the 
earliest times) in the population of the Ganges valley than it was 
in the western areas. Putting these two pieces of evidence together 
is not difficult, and they begin to add up toa fairly coherent picture. 
Clearly, we are still at the beginning in the creation of this picture. 
But these few examples may give some notion of the value of this 
type of evidence. 


(7) Social factors in internal convergence 


The persistence of diglossia, extensive bilingualism, and in 
recent years, processes such as urbanization have been important in 
promoting convergence. These processes have led to changes in 
social structure, and increased migration from rural to urban areas- 
either within the same linguistic region, or into other regions. Such 
changes have affected many stable diglossia situations, and have 
created multilingual settings which require constant code-switching. 
Such changes are illustrated in this volume by ULLRICH and DE 
SILVA. According to Ullrich, the Havik Brahmins did not need 
facility in colloquial and/or literay Kannada dialects when they 
primarily interacted with each other. However, because of moder- 
nization resulting in literacy growth, increased communication among 
members of different castes, and the expansion of occupational 
domains for the Haviks, isolation of the Havik community is break- 
ing down. This in turn seems to create a situation of dialectal 
convergence. Thus here is a situation in which speakers of one 
prestige dialect, because of their growing contact with speakers of 
other dialects, have gradually started showing morphological coexi- 
stence of two different sub-systems in their speech. ‘This is merely 
the first step towards further convergence perhaps leading to the 
adoption of the standard dialect, or some variation of it. 


A somewhat similar situation occurs for Sinhalese too, although 
there the spread of education and popularization of literature has 
resulted not in change towards the ideal model, but in the hybridiz- 
ation of the normative literary and colloquial variety of Sinhalese. 
However, this slightly different situation still supports the hypothesis 
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stated above, that modernization and social change lead to conver- 
gence and hybridization. The earlier highiy stratified social situation 
perhaps helped to retain the stable diglossia situation for many 
regions in South Asia, although even here as De Silva has tried to 
show, while the diglossia situation continues, there is constant 
hybridization between the different varieties of both the high and the 
low forms of speech. Thus we have two situations, one of stable 
diglossia and the other of constant hybridization, which seem to 
counter-balance each other. 

A question which needs to be raised in this comnection is the 
following: How will mass education and awareness of the standard 
variety affect the contact-convergence situation, especially the kind 
that Gumperz (1971) has described? Part of his argument is 
that the kind of Urdu, Marathi, and Kannada that people in 
Kupwar speak is quite different from the standard varieties for all 
three languages. Although he claims that the literacy may not 
change the pattern established over many years of having a common 
syntactic code for all three local varieties. What is needed is some 
degree of higher education whereby the speakers of the individual 
languages may begin to realize the stat us-significance of the standard 
variety and may try to acquire it. How will this process affect the 
existing situation ? 

In cases where convergence arises because of the migration 
of one group into another region, the possibility just described is 
not likely to occur. For example, it is unlikely that the Saurashtri 
speakers in Tamilnad will begin to imitate Saurashtri spoken in 
Gujarat. Recent fieldwork by Apte in a somewhat similar situation 
in Tamilnad with regard to the Marathi-speaking migrants indicates 
that the minority community may not at all be anxious to learn 
the standard variety of their native language. 


(8) Genetic classification 


The question of genetic classification of various pidgin and 
creole language has been discussed extensively in the lisht of available 
descriptions of such languages. However it has not “been answered 
satisfactorily. In the South Asian context, we have examples of 
convergence occurring not only between languages within the same 
family, but between languages of different families (BENDIX 
PANDIT). How are these varieties to be classified ? Numeruos 
solutions, suggested for other cases, come to mind. In the Sanur 
case, it may be feasible to call it a dialect of Tamil, althoush 
Saurashtri speakers would certainly find it difficult to accent su *; 
classification. What about the Newari-Nepali situation? In ne 
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case, the question is not pressing, because the convergence has not 
resulted in such great change. In the case of Bombay Hindi-Urdu 
described by APTE, the problem does not really arise since both 
Hindi and Marathi are IA languages. 


The more basic question to be raised in this connection 
however, is this: What is the relevance of such genetic classification 
to cases of convergence in general? Inthe nineteenth century, such 
genetic classification was undertaken primarily to trace the origins 
of various languages for the purpose of historical reconstruction. 
However, such classification does not necessarily explain anything. 
As Hymes puts it (1971:80) “After all, if to place a pidgin or creole 
genetically sufficed, there would beno special subject of pidgin and 
creole. In fact, genetic classification explains no more than a 
fraction of the history and formation of a language...”’. 


Therefore one solution would be simply to reject genetic 
classification as not relevant to situations of the kind described in 
this volume. Alternatively, classification could be based either on 
geographic area, or on typological factors such as the basic order 
typology suggested by Greenberg (cf. MASICA). One goal of this 
volume has been precisely to show how the SA languages seem to 
become more and more alike, although historically speaking, in 
terms of their origin, or in terms of what they have retained, they 
belong to four different language families. Yet another way of 
solving the problem would be to develop modes of functional 
classification as suggested by Hymes which take ‘“‘as basic the ways 
in which linguistic means are organized for social ends, and extending 
the scope of classification from individual lines of historical trans- 
mission to the linguistic repertoires of persons and communities.’’ 


(Hymes 1971:83). 


(9) Language and dialect 

Finally, the papers in this volume lead us to a reassessment 
of a basic problem in linguistics, namely definition of the concepts 
‘language’ and ‘diaJect’ and the relationship between them. It has 
been obvious now for some time that the terms are most often used 
in the non-linguistic, i.e. sociocultural sense, since there is no unani- 
mity among linguists about the precise application of these terms, 
and no acceptable linguistic definition of them. What emerges 
from the discussion of various papers in this volume is the realization 
that contact and communication are two important factors which 
shape the destiny of many ‘languages’ in South Asia. Thus mutual 
intelligibility, appears to be the goal and/or the result, either uncon- 
scious or conscious, in the cases studied. Therefore it is probably 
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useful to employ those terms which treat this criterion as the basis 


of definition. 
In this connection it is worthwhile to examine two terms 


‘L-simplex’ and ‘L-complex’ suggested by Hockett (1958 : 323 24). 
His primary criteria for determining what is L-simplex and what is 
L-complex is mutual intelligibility. Thus he talks about sets of 
idiolects which he calls L-simplex or L-complex, depending on how 
they are related to each other in terms of mutual intelligibility. His 
argument is that the relationship of these two sets to the conventional 
language label is varied. In the South Asian context, it is very 
feasible to locate numerous sets of L-simplexes which can then be 
classified into fewer number of L-complexes through chains of 
mutually intelliglble idiolects. Such sets of course may criss cross 
over the whole area, ignoring the traditional language and language- 
family boundaries. Such classifications would definitely bring out in 
a better way the closeness and continuity of South Asian language. 
The point has been argued before about the existence of extended 
chains of interelated idiolects and dialects, not only for the major 
regions of Hindi and other IA languages, but also for border areas 
of IA and Dravidian languages (Gumperz 1962, Apte 1970). 


Thus while using the terms L-simplex and L complex for 
grouping the speech varieties in all regions of South Asia, the term 
‘language’ can be applied only ina sociocultural sense, to refer to 
that particular variety recognized as the ‘standard’, and also to 
the written version which represents that standard variety. A 
Separation along these lines is necessary so that linguists can use their 
terms with much more rigor without involving all the non-linguistic 
connotations which tag along with the usage of the terms ‘language 
and dialect’. The suggestion made here of reviving Hockett’s terms 
is merely one of the many alternatives linguists ought to consider 


in studying the complicated situations of contact and convergence 
among languages. 


Apart from its theoretical importance, this question has certain 
important practical implications for communication, as well as for 
concepts of national or regional unity in South Asia. Many obser- 
vers of the area are struck by the extent to Which converged and 
pidginized varieties of SA languages, however important for purposes 
of practical communication, are considered inferior by educated 
members of the society. This point has often been made by linguists 
but mainly in talking to each other. Hopefully future stndy of 
this phenomenon will invoive some attempt to educate other members 


of the society about the communicative potent; 
nt 6é > 
forms of speech, P lal of these “impure 
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NOTES 


This is a composite introduction, prepared by consolidating 
two independent originals. The authors wish to express their thanks 
to Professors Edward H. Bendix and Joan P. Mencher, who offered 
useful criticism and comments. 


1. InSouth Asia we include (primarily) India, Pakistan, Nepal, 
Sikkim, Bhutan, Bangladesh, Srilanka (formerly Ceylon), the 
Laccadive and Minicoy Islands, and (secondarily ) Afghanistan, 
Tibet, Burma. 


2. In fact, a native Tamil speaker who assisted with the collection 
of Saurashtri data achieved a higher level of fluency in 
Saurashtri than a native speaker of Gujarati did in the same 
amount of time (P. B_ Pandit, personal communication). 


3. Though Southworth (informal observation) notes that the 
“Urdu”? used by some uneducated southern Muslims, and 
which they report as their home language, shares many features 
with Bombay Hindi-Urdu. 

4. Note, however, that there must have been reversals of the 
direction of convergence in some cases. Gumperz & Wilson 
are able to say that Marathi is the least changed of the Kupwar 
varieties, because they are comparing Kupwar Marathi with 
‘standard’? Marathi, which (as Southworth 197la shows) 
has already been strongly influenced by Dravidian over a 
period of many centuries. 
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CONVENTIONS USED 
The attention of readers is called to the following: 


(1) Bibliographical references follow the format of the journal 
Language. However, we have omitted all diacritical marks ftom 
the bibliographies, in the interest of reducing errors in proo- 
freading. 

(2) The general bibliography is intended to cover primarily South 
Asian linguistics and questions of linguistic convergence. There 
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(3) 


(4) 


(5) 


(6) 
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fore, we have excluded from it those items which do not appear 
to bear on these subjects. These will be found in the lists of 
references following the papers in which they are referred to. 
Such references are followed by *in text (e.g. Sauer 195293; 
All references not followed by *will be found in the general 
bibliography. We note here that Masica’s paper contains some 
useful references on cereals in Asia and Africa; De Silva’s paper 
gives some references pertaining to current trends in Sinhalese 
literature (many of them in Sinhalese). 

The possibility of consensus being remote, we have made no 
attempt to normalize transcriptions of linguistic forms. 

We use the term ‘Hindi-Urdu’ to refer to the vernacular 
language of North India, though not for its literary or normative 
variants, are usually referred to as ‘Hindi or ‘Urdu’. Cf. foot- 
note no. 2 to Apte’s paper. 

In the introduction, we have used the names of authors in 
capitals (e.g. APTE, BENDIX) to refer to the papers in this 
volume. 

The following abbreviations have been used: 


SA = South Asia, South Asian 

IA = Indo-Aryan 

OIA = Old Indo-Aryan 

MIA = Middle Indo-Aryan 

NIA = Modern Indo-Aryan 

IL = indigenous (i.e. pre-Indo-Aryan) language (s) 
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PIDGINIZATION OF A LINGUA FRANCA: 
A LINGUISTIC ANALYSIS OF HINDI-URDU SPOKEN IN BOMBAY} 


Mahadev L. Apte 


Introduction 


Although the interest in pidgin and creole languages dates 
back to the late nineteenth century, the increasing emphasis on the 
study of these languages in recent years is one of the many phases 
of the considerable attention being paid by social scientists to problems 
of communication in culture-contact situations in various parts of 
the world. Such an emphasis also reflects the new awareness of the 
importance of pidgin and creole languages in the development of 
sociolinguistic theory. 


The purposc of this paper is to present a preliminary linguistic 
analysis of Hindi-Urdu? as spoken inthe city of Bombay and its 
Vicinity by acertain segment of the population, and to discuss the 
characteristics of this speech in the context of various theoretical 
developments pertaining to pidginization and creolization of languages 
in linguistics, anthropology, and sociology. Such development have 
occurred because of the attempts to answer many questions such as: 
What is the nature of the communicative process involved in langu- 
age-contact situations in general; what diagnostic criteria shoule be 
used in the genetic classification of these languages; what is the 
importance of language function in determining the precise nature of 
the processes of pidginization and creolization; and what is the rel- 
evance of socio-cultural factors to the analysis of pidgin and creole 
languages. These theoretical developments have occurred because 
of the awareness that the traditional synchronic and diachronic 
linguistic theory as developed in the last four decades is perhaps not 
quite adequate to satisfactorily resolve the problems of terminology, 
of methodology (including analytical techniques), of taxonomic 
models for genetic classification of languages, and of the effect of 
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socio-cultural factors on the development of pidgin and creole langu- 
ages (Hymes 1971). 


Background 

The nature of linguistic diversity in India has long 
primary interest to scholars.? The linguistic situation in India can 
be very briefly described as follows. 


been of 


Out of the multitude of languages and dialects belonging to 
four families (Census 1961, 1971), thirty-three are each spoken by 
overa million speakers. Of these, thirteen languages including 
“Hindi” and “Urdu” are used for administrative purposes by state 
governments. ‘Hindi’’ is spoken by the largest segment and is the 
official language of the central government as well as of five state 
governments. 


Within each major regional language there are speech varieties 
based on such factors as geography, age, sex, education, caste, 
religion, occupation, and residence. 


Notwithstanding the political debate about the status of 
‘Hindi’ as a lingua franca, it is a fact in the non-Hindi-Urdu regions 
of India, numerous speech varieties labelled Hindi, Urdu, or 
Hindustani are spoken. The variation occurs not only across regions, 
but also along the socio-economic matrix, and along the urban/rural 
continuum. Inthe native Hindi-Urdu speaking area itself, there 
are varieties based on factors mentioned above. While English is 
preferred by the elites as a lingua franca for pan-Indian communi- 
cation, both Hindi-Urdu and English in their many varieties are 
used side by side to communicate across language barriers. 


It is however in the big multilingual urban centers, mainly 
outside the Hindi-Urdu region, such as Bombay, Calcutta, Ahmeda- 
bad, Madras, Bangalore and Hyderabad that a continum of levels 
in the proficiency of Hindi-Urdu is readily noticed. Restricting our 
comments to Bombay and its vicinity, we need to recognize two 
distinct speech varieties in the continuum. These varieties are 
socioculturally and socioeconomically oriented as has been the case 
in Jamaica (De Camp 1971:28). 


The first speech variety labelled here as Jevel / Speech is 
spoken by persons who are educated, who come from the middle or 
upper socioeconomic Strata of the population and who learn lanugage, 
be it “Hindi” or “Urdu”, formally, either in school or in various 
language institutions. Political leaders, persons with white collar 
jobs and teachers, among others use Jevel I speech, 
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Speakers of this variety are conscious of the grammatical 
structure of the language and many of them are proud of the fact 
that they know all the exceptions to the rules. Their vocabulary 
is fairly extensive and is often Sanskritized depending on whether they 
think they are learning ‘“‘Hindi’’ or ‘“‘Urdu’’. They can read and write 
either Devanagari, if they learn ‘‘Hindi’’, or the Perso-Arabic 
script, if they learn ‘“Urdu’’. For many of them, especiaily for 
teachers, speaking and writing ‘“‘correct Hindi’’ or “‘correct Urdu’’- 
correct according to the recognized and well-established grammatical 
standard —-— is a status symbol. 


The use of this variety, however, is restricted primarily for 
formal occasions or for writing purposes [tis rarely used in in- 
formal day-to-day interaction with others. For such informal 
occasions, the mother-tongue or often English is used. Teachers use 
this variety in the classroom for lecturing. They are purists and 
have mastered acomplete command of the grammar. They usually 
insist that students should know all the grammatical structure. 
Others use it primarily for political speeches or other public functions 
and at large gatherings in the city where the audience is likely to be 
multilingual. 


The second speech variety labelled here as level II speech 
is Hindi-Urdu spoken by persons who are not well educated, come 
from the low socio-economic strata of the population, and either 
have such non-white-collar jobs as porter, taxi-driver, hawker, 
waiter, servant, etc, or are petty shop-keepers. Since people in 
this group depend extensively for their occupational security on the 
ability to communicate with others of different linguistic back- 
grounds, there is a greater emphasis on communication than on 
grammatical ‘‘niceties’”. Individuals in this socioeconomic and 
occupational strata use Hindi-Urdu extensively in their daily routine. 
It appears that those who use level II speech have a high degree of 
motivation and no inhibition in using Hindi-Urdu. Yet they have 
a minimal degree of grammatical mastery. Despite this lack of 
grammatical proficiency, these speakers accept that what they speak 
is Hindi-Urdu. It is this speech variety that will be discussed in 
the following section, It seems that as we move from level II, the 
ratio of the variable “frequency of usage’? to that of “‘grammati- 


cality’’ is inverse. 


It should be noted that the developmeot of this type of 
Hindi-Urdu in the urban settings in India has not gone unnoticed. 
As early as 1931, S. K, Chatterjee, a prominent linguist, described 
the salient features of Hindi-Urdu - - he called it Hindustani - — as 
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tropolis of Culcutta. 
and although 
in other big 


spoken in the business districts of the great me 
Unfortunately, there was no follow-up of his efforts, 
scholars are aware that similar speech varieties exist 
urban centers, they have not yet been analyzed. 


Chatterjee based his analysis on his own observations and 
also on published materials, primarily booklets of songs and plays 
used by the dancing and singing groups in the market places of 
Calcutta (1931: 194). He refers to Bombay Hindustani a few times 
in his paper, but does not elaborate on its characteristics. 


My own analysis is based on material collected sporadically 
during 1963-65. Itis not sufficient for presenting an exhaustive 
linguistic analysis. I hope however, that it will be supplemented by 
others based on extensive data collected'in a more systematic manner. 
Despite the nature of my data, the similarities in the Hindi-Urdu 
speech of Bombay and the Hindustani speech of Calcutta in 1931 are 
obvious and will be mentioned following the linguistic analysis of 
Bombay Hindi-Urdu. 


Linguistic Analysis 


The analysis presented below includes specific statements about 
the phonological, grammatical and lexical peculiarities of Bombay 
Hindi-Urdu /evel IJ speech. It also includes discussion of the simpli- 
fication of structure, and of new constructions patterned after 
Marathi. It is worth noting that most of the peculiarities of this 
speech occur at the grammatical level. Examples are given to 
illustrate the statement. The following abbreviations are used. HU 
for Hindi-Urdu, BHU for Bombay Hindi-Urdu and M for Marathi 
An idiomatic English translation in single quotation marks precedes 
the examples. The literal English translation generally follows the 
BHU version. 


Phonology 


_At the phonological level, the only noticeable features are 
substitution of the Hindi-Urdu vowel phonemes € and 9 by the 


diphthongs ai and au respectively, and substitution of vowel sequ- 
ences de by the diphthong 4y. 


1. ‘I shall go there tomorrow’ 


HU m¢ kal waha jaun- ga 
may kal waha jay- ga 
I tomorrow there go will 
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2. ‘we drink tea’ 


HU ham cay pite he 
BHU ham cay pita hay 
we tea drink 


3. ‘Come at nine o’clock’ 


HU no baje aiye 
BHU nau baje aw 
nine o’clock come 
Grammar 


Nouns: The primary feature of nounsin BHU is the absence 
of their plural and oblique forms. In HU both masculine and 
feminine nouns ending in different vowels and consonant form the 
plural either by replacing the final vowel by another, or by adding a 
vowel. For example Masc. sg. kamra ‘room’ pl. kamre; Fem. sg. 
dukan ‘shop’ pl. dukané; Fem. sg. dawa ‘medicine’ pl. dawaé; Fem. 
sg. ladki pl. ladkiya. In BHU the singular forms are used most 
frequently. Only the plural form of masculine nouns ending in 4 is 
occasionally used. Often the context makes it clear whether a singular 
or plural form is used. Occasionally an adjective of quantity such 
as bahut ‘many’ or sab ‘all’ is used. If anumeral precedes the noun, 
there is no ambiguity. 


4, <All shops are closed’ 


HU sab dukané band he 
BHU sab dukan band hay 
All shops closed are 
5. ‘I have three sons’ 
HU mere tin bacce he 
BHU apne-ku_ tin bacca hay 
me to three boys are 


In HU postpositions are added to the oblique forms of nouns. 
Thus there are singular and plural oblique forms of nouns, e. g. 
kamra ‘room’, kamre-mé ‘in the room’ kamr6-mé ‘in the rooms’; 
dukan ‘shop’ dukand-mé ‘in the shops’; Jadkiyd-ko ‘to the girls’ 
etc. In BHU the distinction between nominative and oblige forms 
is obliterated and nominative forms are used with postpositions. 


6. ‘Let all boys go’ 
HU sab ladk6- ko nikal do 
BHU sab ladka- ko nikal dew 
all boys to remove imperative 
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7. ‘He will come next week’ 


HU wo agle hafte- mé ae- ga 
BHU wo agla hofta- me ay- ga 
he next week in come will 


HU has a two gender system and every noun, whether animate 
or inanimate must be either masculine or feminine. M has a similar 
three gender system. Although there are many cognates between 
M and HU, they belong to different genders. This creates a problem 
for BHU speakers. Gender markers are also important in HU and M 
because of the concordance of verbs and adjectives with the nouns 
for gender. The BHU usage of noun does not coincide with that of 
HU. Many feminine nouns in HU are used as masculine in BHU. 
HU feminine nouns which are cognates of M neuter nouns are used 
as masculine nouns. For example dukan ‘shop’ which is neuter in 
M is used as masculine in BHU although it is feminine in HU. So 
are pustak ‘book’, dawa ‘medicine’, chat ‘ceiling’ and others. If HU 
feminine nouns have masculine cognates in M, they are used as 
masculine nouns in BHU. For example the HU feminine noun 
Kamiz ‘shirt’ is used as masculine in BHU because the comparable 
nounsin M is masculine. Similarly are used other nouns like 
baniyain ‘undershirt’, cay ‘tea’ etc. 


Pronouns: HU has three second person pronouns tu, tum, 
and ap. Of these only one, tum, is frequently used in BHU; tu is 
only occasionally and 4p is rarely used. From among the first person 
pronouns mé¢ and ham, the plural pronoun hymn is used much more 
frequently in BHU, often in the sense of first person singular. On 
the other hand, the third person singular pronoun wo is much more 
common than the plural form we irrespective of the number of 
persons or objects referred to by the speaker. 


In HU, postpositions are added to the Oblique forms of 
pronouns as is the case with nouns. However, different oblique 
forms occur before different postpositions and case suffixes. For 
example the oblique forms of the first person pronoun me are 
mujh- before dative and ablative case Suffixes and mer- before 
genitive case suffixes. The form mer-e occurs with some other post- 
positions such as sath, liye, pas ete. The comparable rank for 
second person singular pronouns are tujh-, ter- and ter-e and th 
plural forms are tum, tumhar- and tnmhar-e respectively. In BHU 
the forms before dative and ablative Suffixes for these i 
mere, tere, tum and tuma-re. ibe se 
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8. ‘I don’t know’ 


HU mujhe/mujh- ko malum nahi (he) 
BHU mere- ko malum nahi 
me to known not 


9. ‘Don’t talk to me’ 


HU mujh- se bat mat. kyive 
BHU mere- se bat mat karo 
me with talk don’t make 
10. ‘Do you know this?’ | 
HU kya ap- ko yah malum he ? 
BHU kya tere- ko ye malum hay ? 
or 
kya tum- ko ye malum hay ? 
What you to this known is 


11. ‘I will not talk to you’ 


HU me ap- se bat na karunga 
BHU: ham tere=- se bat kare ga nahi 
I you with talk make will not 


In HU when a case suffix or a postposition is added to a 
masculine noun which is modified by a preceding genitive pronominal 
modifier, both the noun and the pronominal modifier occur in oblique. 
forms, e. g., mer-a ladka ‘my son’, but mer-e ladke-ko ‘to my son’; 
tumhar-a kamra ‘your room’, but tumhar-e kamre-me. In the 
comparable BHU constructions, the noun occurs in the nominative 
form (see nouns above) and the pronominal form does not change. 


12. ‘What happened to your son ?’ 
HU tumhar-e ladke- ko kya huwa ? 
BHU tumar-a ladka- ko kya huwa? 
your boy to what happened 


13. ‘There are many persons in our home’ 


HU hamar-e  ghar- mé _ bahut log he 
BHU hamar-a ghar- me bahut log hay 
our house’ in many persons are 


The oblique forms of the third person pronounsin BHU are 
the same as in HU, e.g., is for yah (ye), ‘This he, she, it’, ws for wah 
(wo) ‘That he, she, it’, in for ye ‘These’ and wn for we ‘Those’. 
However, the plural oblique forms in and un are much less frequent 
than the singular forms even when more than one person or object is 


being referred to. 
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The oblique form apne of the HU reflexive pronoun apna is 
often used in BHU in the place of the first person plural pronoun 
ham and its oblique form. 


14. ‘We don’t know’ 


HU hamé malum nahi (h¢) 
BHU apne- ku malum nabi : 
we to known not (is) 


15. ‘We waat tea’ 


HU hamé cay cahiye 
BHU apne- kus cay manta 
we to tea want 


From this use of the oblique form of the reflexive pronoun, a 
new first person pronoun apan is used in BHU. This appears to be 
the influence of Marathi which has a first person inclusive/exclusive 
plural pronoun apan. 


16. ‘We will not go’ 


HU ham nahi jaenge 

BHU apan jay- ga nahi 

M apan jan- ar nahi 
I/we go will not 


However, the reflexive pronouns as such are not used in BHU. 
17. ‘He is in his (own) shop’ 


HU wo apni dukan- mé he 
BHU wo uska dukan- me hay 
he his shop in is 


18. ‘Call your son’ 


HU (tum) apne bacce- ko bulao 
BHU (tum) tumhara _ bacca - ku bulaw 
you your son to call 


Adjectives: In HU some adjectives are variable because the 
final segment of the adjective changes according to the gender and 
number of the noun following. Such variable adjectives also have 
an oblique form which precedes nouns with postpositions. In all 
comparable BHU constructions the masculine Singular form is used, 
19, ‘Please go into the third room’ 

HU ap tisre kamre- mé jaiye 

BHU tum tisra kamra- me jaw 

you third room in go 
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20. ‘He will come next week’ 


HU wo agl-e hafte- mé ae- ga 
BHU wo agl-a hafta- me ay- ga 
he next week in come will 


The third person pronominal forms which also function as 
demonstrative adjectives when preceding nouns in HU also follow the 
Same pattern as variable adjectives with regard to the occurrence of 
oblique forms, But in BHU the oblique forms are not used. 


2). atch that man’ 


HU us admi- ko pakdo 
BHU wo admi ko pakdo 
that man to catch 
22. ‘What happened to this boy ?’ 
HU iS ladke- ko kya huwa 
BHU ye ladka- ku kya huwa 
this boy to what happened 


Verbs: The paradigms of verbs in various tenses in HU are 
generally simplified in BHU and only one form in each tense is used. 
The irregular forms are rarely, if ever used, except those of the most 
frequent verbs, such as jana ‘to go’. Even for this verb, from its 
future tense paradigm jaunga/gi, jaenge/gi, jaoge/gi, jaega/gi, only 
one form jaega-and that too in the shape of jayga-is used. This is 
also true of the imperfect and perfect tenses of this verb for which 
are used the forms jata and gaya respectively. 


23. ‘What will you take ?’ 


HU tum kya lo- ge? 
BHU tum kya le- ga? 
you what. take will 
24. ‘We live here’ 
HU ham yaha rahte h¢ 
BHU ham yaha rahta hay 
we here live 


25. ‘We will not come’ 


HU ham nahi aenge 
BHU ham ay- ga nahi 
we come will not 


26. ‘Where had you gone yesterday ?’ 


HU tum kal kaha gaye the 
BHU tum kal kaha gaya tha 
you yesterday where gone had 
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What is true of the main verbs is also true of the auxiliary 
verb hona ‘to be’ of which only one form is used in each of the 
imperfect, perfect, and future tenses. In the imperfect tense the 
singular plural distinction indicated by the nasalization in HU does 
not occur. The form hay is used with any subject; the perfect 
form is tha and the future form is hoga. 


21. “YOU areicrazy. 


HU tum pagal ho 
BHU tum pagal hay 
you crazy are 
28. ‘We are all poor’ 
HU ham sab garib he 
BHU ham sab garib hay 
we all poor are 
29. ‘They were very unhappy’ 
HU we bahut naraj the 
BHU we bahut naraj tha 


Occasionally, the future tense meaning of a verb is indicated 
in BHU by the addition of the suffix wa/a to the infinitive oblique 
form of the verb although such constructions do not occur in HU. 


30. ‘I will do this work tomorrow’ 


HU me kal ye kam karun- ga 
I tomorrow this work do will 
BHU may _ kal ye kam karne-wala hay 
I tomorrow this work  do-er am 


In BHU a verb for which there is no equivalent in HU is often 
used, This is manna which is semantically equivalent to the HU 
verb cahna ‘to want’. It is possible that this verb was derived from 
the transitive verb mangna ‘To ask for, to request etc.’. However 
its usage appears parallel to cahna. 


31. ‘What do you want ?’ 


HU apko kya cahiye 
BHU tum- ku kya manta hay 
you to want what is 


The auxiliary verb hona ‘to be’ is often used in BHU in the 
sense of ‘to want’. This seems to be an independent development 
since there is no such usage and extension of meaning in HU. 
32. ‘What do you want’ 

BHU tum- ku kya hona 

you to what want 
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33. ‘I want a cup of tea’ 
BHU apne- ku ek 
I to one 


The comparable construction in HU for the above sentences 
are the following. 


hona © 
want 


Kap 
cup 


cay 
tea 


HU ap- ko (tumhe) kya cahiye 
you to (you to) what is wanted 
HU mujhe (hamé) ek kap cay cahiye 
| me to (me to) one: < cup tea is wanted 


Emphatic Particles and Conjunctions: Some emphatic parti- 
cles and conjunctions used in BHU do not occur in HU. However, 
very similar forms occur in Marathi and it is reasonable to assume 
that BHU has adopted them from Marathi. Among the emphatic 
particles, ca which is equivalent to Ai in HU, is commonly used. 
The conjunctions most frequently used are pan ‘but’ and ki ‘or’, 
and are comparable to /ekin and ya in HU. 


34, ‘But he will not come’ 
HU lekin wo nahi ae— ga 
BHU pan wo ay- ga nahi 
M pan to yen— ar nahi 
but he come _ will not 
35. ‘He alone will go’ 
HU wo hi jaega 
BHU woc jayza 
M toc jail 
36. ‘You certainly must have some money’ 
HU ap-ke pas kuch paise to hoge hi 
BHU tumara pas kuch paise hogac 
you near some mony must have 
37. ‘Do you want to go or not?’ 
HU tumhé jana he ya nahi (he) 
BHU tumku jane ka hay ki nahi 
M tumha- 1a jayca Ahe kd nahi 
you to going is or not (is) 


Some syntactic features: In HU the negation of action is 
indicated by the negative particle nahi ‘not’ for imperfect, present 
progressive and future tenses, while the particle na ‘not’ indicates it 
for imperative and optative. In all such constructions the negative 
particle always occurs before the main verb, and the auxiliary 
verb, if it occurs, can be deleted optionally. In BHU however, the 
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tendency is to use the negative at the end of the sentence. 
influence of Marathi in which the negative 


perhaps due to the 


auxiliary verbs occur in sentence final position, especia 


Apte 
This is 


lly the verb 


nahi ‘not to be’ which is quite similar to the HU particle nahi. 


38. ‘He will not come’ 
HU wo nahi ae- ga 
he not come 
BHU wo ay— 
M to yen- 
he come 
39. ‘We don’t want any tea’ 
HU hamé cay 
BHU apne-ku cay 
M aplyala caha 
us to tea 
40. ‘We don’t want to go’ 
HU ham nahi 
we no 
BHU hamku jane-ka 
M amha la 
us to 


will 


ga nahi 
ar nahi 
will not 
nahi cahiye 
mangta nahi 
nako 

not wanted 

jana cahte (he) 
going want 
nahi 

jayca nahi 
going is not 


In HU sentences where the transitive verbs occur in the past 
tense the subject nouns or pronouns occur with the agentive suffix ne. 
In BHU however, if the subject is in the first or second person, the 
agentive suffix is often omitted. This is also probably due to the 
influence of Marathi which has similar constructions but in which 
the first and second person pronoun do not take an agentive suffix. 


41. ‘We saw him’ 
HU ham-ne use 
BHU ham usku 
M amhi tyala 

42. ‘Why didn’t you do this work ?’ 
HU tum-ne ye kam 
BHU tum ye kam 
M tumhi he kam 

you this work 


dekha 

dekha 

pahyla 
kyo nahi kiya 
kay-ko kiya nahi 
ka kela nahit 
why did not 


Although nouns are modified in HU by adjectival forms 
derived by adding the genitive suffix -ka, ki, -ke to the oblique © 
infinitive verb forms, the modifiers do not occur as predicatives in 
constructions with the verb to be. In Marathi on the other hand, 
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such constructions abound. In these constructions the subject is in 
the dative case. In BHU such constructions occur frequently and 
it is reasonable to assume that BHU is influenced by M in this respect. 


43. ‘We want to drink tea’ or ‘We intend to drink tea’ 


BHU ham- ku cay pine- ka hay 
hamare- ku cay pine- ka ahe 
M amha- la caha pyay- ca ahe 
we to tea drinking of is 


The semantically comparable HU utterance would be the 
following: 


HU hamara cay pine ka irada he 
our tea drinking of intention is 
or 


hamara irada cay pine-ka he 
Another HU construction similar to the one above is: 
HU hame cay pina he 
we to tea drinking is 


A question is formulated in BHU from constructions similar 
to the one above by adding a question particle in the sentence final 


position. 
44. ‘He wants to see a movie’ 
BHU us- ku sinema dekhne- ka hay 


He to movie seeing of iS 


45. ‘Does he want to see a movie?’ 
BHU us- ku sinema dekhne- ka hay kya 
Here too the Marathi influence is obvious since such questions 


are formulated in Marathi by the same technique whereas in HU 
the question particle occurs in sentence-initial position. 


45a. ‘Does he want to gotoa movie?’ 
M tya-la sinema-la jay-ca ahe ka? 
45b. HU kya us-ka sinema dekhne-ka irada he? 
In certain types of complex constructions in HU, the principal 
clause consists of an adverbial particle and the verb ‘to become’, 
while the sub-ordinate clause explains the event. The principal 
clause occurs initially. In BHU similar constructions occur, but 
with the addition ofa pronominal with a genitive suffix and a 
demonstrative. The BHU construction is patterned after that of 
Marathi. 
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46. ‘It so happened that we did not go home yesterday’ 
HU csa huwa ki ham kal ghar nahi gaye 
BHU _us- ka ye ays huwa_ ki ham kal ghar gaya nahi 
M tya- ca he asa jhala ki amhi kal ghari gelo nahi 
that of it so happened that we yesterday home went not 
One type of question-utterance is formulated in BHU fa 
adding a genitive suffix to the question particle kitna ‘how much , 
There is no parallel construction in HU, but the same meaning can 
be expressed another way. 


47. ‘How much does this cost ?’ 


BHU ye kitpe- ka hay 
this how much of is 
HU is— ka dam kya he 
this of price what is 
Vocabulary 


BHU includes words from both HU and M. Since most Indian 
languages have many lexical items in common, BHU speakers 
frequently use words from other languages in their speech. The 
common vocabulary pool of the Indian languages and the tendency 
to use words from languages other than HU has prevented the 
creation of new words in BHU. 


Among the non-HU words used, those from Marathi seem 
more numerous. Many cognates are found in Hindi-Urdu and 
Marathi where Marathi retroflex consonants, expecially n and] 
correspond to Hindi-Urdu dental consonants. For example, M karan 
‘cause’, HU karan; M bolne ‘to speak’ HU bolna; M galti ‘dripping’ 
or ‘mistake’ HU galti; M nila ‘blue’ Hu nila; M kala ‘black’ HU 
kala etc. On the basis of this pattern many Marathi words are used 
as Hindi-Urdd words with a change from the retroflex to the dental 
articulation for the nasal and lateral consonants. For example, BHU 
palna from M palne ‘to run’; BHU maran from M maran ‘death’; 
BHU zal from Maal ‘tap’ etc. Often many of these words do not 
occur in Hini-Urdu. 


Many of the characteristics features of BHU described above 
also occurred in Calcutta Hindustani as stated by Chatterjee in 1931. 
For example, the following features seem common to both. (The 
page references are to Chatterjce’s article.) Substitution of the 
vowels ¢ and o by diphthongs (196); lack of certain plural forms of 
nouns (204); lack of differentiation between nominative and oblique 
forms of nouns (205); lack of the obliqe form of the variabie 
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adjectives (207); and reduction in the verb paradigms for various 
tenses (212-213). 


Thus, despite the influence of the dominant language of the 
region of each of these speech varieties, that of Bengali on Calcutta 
Hindustani, and that of Marathi on Bombay Hindi-Urdu, it is 
obvious that when Hindi-Urdu, itis obvious that when Hindi-Urdu 
is used in multilingual settings of big urban centers by speakers of 
different linguistic backgrounds, the process of simplification runs a 
parallel course. 


Discussion 


As mentioned in the introduction, the Bombay Hindi-Urdu 
speech described here appears to share many characteristic features 
of the languages generally labelled as pidgins and creoles. A survey 
of the extensive literature on pidgin and creole languages suggests 
that the following features are considered to be characteristic of 
these languages: (1) They are contact languages; (2) They 
generally emerge in situations requiring communication between 
person or groups of people of different linguistic backgrounds; 
(3) Their primary function is communication for trade or other 
similar activities, thus there is reduction in function; (4) They are 
generally a mixture of two languages; (5) Their overall structures 
are simpler than either of the two languages of which they are 
mixtures, in other words there is reduction in, or simplification of, 
structure; (6) They have limited vocabularies; (7) They are not 
anyone’s mother- tongue. [This is of course not the case with 
pidgin languages which later become creoles (Hall 1966; Hymes 1971; 
Le Page 1961; Reinicke 1938; Samarin 1955, 1958. 1971; Taylor 1959).] 

The theoretical questions which need to be raised in connection 
with the Bombay Hindi-Urdu speech are the following: (1) Are 
there any special linguistic and sociocultural features of Bombay 
Hindi-Urdu which sets it apart from other pidgin or creole langu- 
ages, and what plausible explanations can be provided for these 
features; (2) Should Bombay Hindi-Urdu be considerd a process 
or a Stable final product of pidginization; and (3) What descriptive 
label can be used for Bombay Hindi-Urdu and howcan it be classi- 
fied. 


Special Features of Bombay Hindi-Urdu 


If the characteristic features mentioned above of the languages 
generally labelled as pidgins are applied to Bombay Hindi-Urdu, it is 
evident that pidginization of Hindi-Urdu has taken place. However, 
this pidginization seems to occur only at level II speech. It 
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is also obvious that certain linguistic and sociocultural features of 
this pidginization are different from those of the pidginization 
resulting from colonialization. 


It is evident to anyone familiar with the linguistic picture of 
South Asia that there is a high degree of structural similarity bet- 
ween the modern Indo-Aryan languages in this area. In the case of 
Bombay Hindi-Urdu, the predominant sub-stratum structure is that 
of Marathi, a language which is structurally quite colse to Hindi. 
It is possible to argue that similarity of structure is the major cause of 
the carry-over of Marathi patterns into Hindi-Urdu since speakers 
of Bombay Hindi-Urdu believe that such patterns would be accepted 
as Hindi-Urdu patterns. The question of whether there any Dravi- 
dian sub-stratum structural features in this speech can only be 
answered after collecting and analyzing considerably more linguistic 
data. However, the fact remains that Bombay Hindi-Urdu has 
definitely been influenced more by Marathi than by any other 
language. 

The degree of similarity mentioned above not only exists at 
the structural level, but also in vocabulary. There is a significant 
percentage of vocabulary common between Hindi-Urdu and Marathi, 
or for that matter, between Hindi-Urdu and other Indian languages, 
especially the kind of vocabulary that is used by speakers of this 
level II speech. As a result, non-Hindi-Urdu speakers, especially 
Marathi speakers, feel comfortable in using Marathi words in their 
Hindi-Urdu speech since it is thought that any Marathi word can 
also be a Hindi-Urdu word. 


Another problem that is not easy to determine is: What is the 
ratio of reduction of Hindi structure to that of the carry-over of 
Marathi structure. Depending on the nature of this ratio, it is 
possible to argue in either direction, i.e., that Bombay 
Hindi-Urdu is pidginized Hindi-Urdu, or simply Hindi-Urdu 
greatly influenced by Marathi. The comparison of Bombay 
Hindi-Urdu to Calcutta Hindustani described by Chatterjee (1931) 
does not clarify the situation because his description shows that 
Calcutta Hindustani also has these two characteristics, namely 
reduction of structure and carry-over of Bengali structure, and it 
would be difficult to determine which of them is more predominant. 


It should also be noted that the reduction of structure is 
not unique to Bombay Hindi-Urdu. The same phenomena seem to 
appear in the speech of educated native speakers in the “Hindi” 
region as has been recetly pointed out (Harris 1968), If the kind of 
reduction of structure which occurs in Bombay Hindi-Urdu and 
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Calcutta Hindustani also seems to occur in the speech of native 
Hindi-Urdu speakers, can such simplification be then considered as 
a marker of pidginization? A related problem is that of determining 
what is ‘“‘standard’’ Hindi-Urdu. In the vast Hindi-Urdu-speaking 
region there are numerous degrees of speech variations, based not 
only on geographical location, but also on age, sex, caste, occuption 
etc. There are further differences between formal and colloquial 
types of Hindi-Urdu. It also appears that the kind of Hindi-Urdu 
described in the traditional grammars of Indian scholars is spoken 
only by very highly educated speakers in formal situations. 


It was mentioned earlier in connection with the vocabulary of 
Bombay Hindi-Urdu that native Marathi speakers who speak level 
If Hindi-Urdu use Marathi words with retroflex nasals and laterals 
as Hindi-Urdu words with the change of these retroflx consonants 
to dentals. It is difficult however, to determine if this is a conscious 
phenomenon, or whether the phonemic distinction between retroflex 
and dental nasals n/n and retrofiex and dental laterals 1/1 simply does 
not exist in the speech of those Marathi speakers who speak this 
type of Hindi-Urdu. There is reason to believe that the latter may 
be the case in many instances because of the available linguistic. 
evidence which suggests that these phonemic distinctions exist 
primarily in the speech of educated Brahmin Marathi speakers 
(Apte 1962). More linguistic data are necessary to answer this 
question. For example, if the mative speakers of any of the 
Dravidian languages who fall into the category of people who speak 
level If Hindi-Urdu also change these retroflex consonant to the 
dentals in words from their mother-tongue in order to use them in 
Bombay Hindi-Urdu, then the phenomenon has to be considered as 
conscious. 


Another question which can be raised while discussing the 
linguistic featuresof Bombay Hindi-Urdu is: What degree of the 
reduction of structure is necessary before this Hindi-Urdu speech 
can be labelled as pidginized. In other words, are there any set 
standards about the degree of simplicity necessary to label a given 
variety of a language as pidginized, or is it purely arbitrary ? 


The dominant language involved in the contact situation 
which has given rise to Bombay Hindi-Urdu speech is not a_ colonial 
language, although it is not a lingua franca. Bombay Hindi-Urdu thus 
differs froma large number of pidgin languages which appear to 
have developed as a result of contact between the speakers of one 
of the European languages such as English, French, Spanish, or 
German and a non-Western language. Another difference is that in 
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di-Urdu 


the case of Bombay Hindi-Urdu, the native speakers of we 
he non- 


do not seem to have a_ higher socio-economic status than t 
Hindi-Urdu speakers. 


Process or Stage? 


The need to make a distinction between the process of 
pidginization and the end result of such a process, namely a stable 
pidgin language has been recognized (Whinnom 1971, Hymes 1971). 
Both presuppose certain factors such as contact of two or more 
distinct speech communities, the need and desire for communication, 
and the lack of formal training in one or the other language, but 
especially in the dominant language. 


All these factors seem to be present in the situation under 
discussion here. The setting of Bombay city is clearly multilingual 
in natures! and the need to communicate is most urgent. Secondly 
the segment of the population which has the greatest need for such 
communication is also unable to learn Hindi-Urdu formally. 


As for the selection of Hindi-Urdu as the target language to 
be learned, there is clear evidence of the over-riding socio-political 
factors in favour of Hindi-Urdu. First, itis spoken by the largest 
segment of the Indian population. Second, some form of Hindi-Urdu 
has been contsantly recommended by the influential political leaders 
as the national language and/or lingua franca both before and after 
independence. The propagation of Hindi-Urdu or some form of it 
by such leaders as Gandhi and Nehru who were popular with the 
masses has had its influence. Third, Marathi, although spoken by 
the largest minority in Bombay had never enjoyed the status of a 
lingua franca until the creation of the linguistic state of Maharashtra 
in 1961, primarily because it has not been the language of trade and 
commerce inthe city.5 The great influx of Punjabi and Sindhi 
speakers to Bombay after partition, the dominance of Gujarati and 
Marwari speakersin trade and commerce, and the presence ofa 
significant portion of Telugu and Kannada speakers, especially in the 
small hotel business, created the situation where Hindi-Urdu was 
favored. There is reason to believe that for those who were outside 
the realms of bureaucracy and white-collar jobs in the city, but who 
needed to communicate across linguistic barriers, some form of 
Hindi-Urdu was the language to be used. 


However, it is difficult to precisely pinpoint the Stage at which 
the pidginization process in this setting is. If it is a process, we have 
to assume a continuum, and only the most exhaustive analysis of 
extensive material will reveal if the Bombay Hindi-Urdu “speech 
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described here can be named with any such labels as convergence, 
pre-pidgin continuum (Hymes 1971:68), salient pidginization, or 
substantive pidginization (Samarin 1971:119), If Bombay Hindi- 
Urdu is to be considered a stable end result of pidginization, we 
need to know the extent of its spread, the time-span of its existence, 
and perpetuation of its fixed structure in a consistent manner. In 
other words, we need to know the number of people who speak the 
kind of Bombay Hindi-Urdu described here and the extent of 
inter-group use; whether idiosyncratic variations in addition to the 
structural feautures described above occur among the Speakers; how 
extensively and for what additional functions this speech variety is 
used outside the occupational domain; whether there are any 
persons who only speak this speech variety and no other; and 
finally how far back can we trace its existence. It is apparent that 
Bombay Hindi-Urdu has been in existence for more than forty or 
fifty years as indicated by referencs to it by Chatterjee in 1931. 
However, it is not possible to answer all these questions in view of 
my limited data. Itis hoped that other scholars interested in 
Bombay Hindi-Urdu, and other similar versions in other urban 
centers of India will be able to provide answers to these questions. 


Even with answers to all the above questions, it may not be 
easy to determine if we are dealing witha process or with an end 
result. If it isa process, whether or not it is pidginization isa 
complex question to be answered. ‘‘What is salient, what is sub- 
stantive, in the standard definition of the properties of pidginization 
is far from resolved’’ (Hymes 1961:71). 


Classification and labelling 


Finally, we need to consider the genetic classification of 
Bombay Hindi-Urdu and to determine what label would be appro- 
priate for it 


Although genetic classification of many pidgin and creole 
languages has been difficult because of the totally different linguistic 
affiliation of the two base languages from which each pidgin or creole 
has derived, the problem does not exist in this case. Since both 
Hindi-Urdu and Marathi of which the Bombay Hindi-Urdu speech 
appears to be an admixture are Indo-Aryan languages, it is clear 
that it too should be classified as Indo-Aryan To what extent such 
classification is affected by the claim that the historical origin of 
Marathi itself indicates prior pidginization (Southworth 1971) is too 
marginal a question perhaps to be considered at this stage. 


What descriptive label should be used for this speech will 
depend on the answers to the questions raised earlier about Bombay 
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Hindi-Urdu being a process or an end result, and also about its 
being a dialect of Hindi-Urdu. The situation appears similar in 
some ways to that of Hawaiian English. The issue of how Hawaiian 
English should be designated has _ been controversial (Tsuzaki 
1971:328). The basic question raised in that case is whether it should 
be designated a pidgin, a creole, or a dialect of English, and although 
all three views are proposed, none is backed by conclusive evidence. 
Another view presented in this connection has been to treat Hawaiian 
English as a set of coexistant systems, which means that it is partly 
a pidgin, partly a creole, and partly a dialect (Tsuzaki 1971:329). 


In the case of Bombay Hindi-Urdu, labelling it a creole is 
simply not feasible at this stage because of lack of answers to many 
questions raised earlier. As for treating Bombay Hindi-Urdu asa 
dialect of Hindi, this too cannot be justified because of Jack of 
evidence about Bombay Hindi-Hrdu being the sole native language 
of at least some speakers. It is my belief that most speakers of 
Bombay Hindi-Urdu speak it as a second language and that too 
primarily when they interact with individuals of different linguistic 
and ethnic backgrounds in occupational, or business settings. 


Another question which needs to be answered, is this: Do 
native Hindi-Urdu speakers use Bombay Hindi-Urdu speech variety 
for communicating with native speakers or Marathi, Gujarati, Sindhi, 
and other Indian languages? If the answer to this question is in the 
affirmative, it makes a strong case for calling this speech pidginized 
Hindi-Urdu. Similar studies of Hindi-Urdu spoken in other metro- 
politan areas in different regions of India will certainly help usin 
determining if the features found in Bombay Hindi-Urdu and 
Calcutta Hindustani are shared by other speech varieties in which 
case it is possible to talk of representative characteristics of all-India 
pidginized Hindi-Urdu. It is already obvious that some characteristics 
of Bombay Hindi-Urdu are common to Hindi-Urdu spoken in a 


multilingual village in the Sangli district of Maharashtra (Gumperz 
and Wilson 1971). 


In conclusion, two labels seem appropriate, If Bombay Hindi- 
Urdu is to be called a process, then it should be labelled Bombay 
Pidginized Hindi Urdu. If, on the other hand, it is to be considered 
an end product, then following the general practice of labelling many 
pidgin and creole languages in which the name of the area where 
such a language is combined with the name of the upper source 


language (Southworth 1971:269), it should be called B ‘doi 
Hindi-Urdu. ombay Pidgin, 
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NOTES 


1. An earlier version of this paper was read atthe twenty-first 
annual meeting of the Association for Asian Studies held in 
Boston, March 28-30, 1969. 

2. The term Hindi-Urdu is used throughout this paper to cover 
the wide range of speech varieties between the two polar versions 
represented by a highly Sanskritized formal Hindi speech and a 
highly Persianized formal Urdu speech. Despite the claims of 
the politicians and others, it is a fact the majority of speakers 
who claim to speak either Hindi or Urdu use instead a mixture 
of these two languages. The major difference between these 
two languages is at the phonological and lexical levels and 
appears most noticeably at the written level as two different 
scripts are used. The ability of an average speaker to switch 
codes to suit the appropriate contexts also makes the whole 
situation more complex. Therefore from the linguistic point of 
view, it is realistic to use the label Hindi-Urdu. Usage of the 
term Hindi or Urdu in quotation marks in this paper reflects the 
official terminology of census reports and of others who takea 


position different from the one stated here. 

3. Starting with the linguistic survey of G. A. Grierson, numerous 
accounts of the linguistic diversity in India have been presented 
by many scholars. Among the more recent attempts are those 
of Emeneau (1956), Ferguson and Gumperz (1960), and Poddar 
(1969). 

4, The approximate percentage of speakers of major Indian langu- 
ages in Greater Bombay in 1961 was as follows: Marathi 42.5; 
Gujarati 18.1; Urdu 9.6; Konkani 4.6; Tamil 2.5; Telugu 2.3; 
Kannada 1.9; Malayalam 1.5; and Punjabi 1.3. Thus the 
native speakers of major Indian languages in Greater Bombay 
excluding native Marathi speakers, is almost fifty percent. This 
is a clear indication of the multilingual nature of Bombay city 
(Census of India 1961 Vol. X, Part x (1-B) Greater Bombay 
Census Tables: 185-192). 


5. The 1961 census data on bilingualism for Greater Bombay 
(cf. source listed in note four above) indicate that from among 
the bilingual speakers of major Indian languages, only speakers 
of Telugu have the highest percentage of bilingualism in Marathi 
thanin any other language. As for the rest, Hindi speakers 
prefer Marathi as the second choice after English; Gujarati, 
Kannada, Tamil, and Urdu speakers prefer Marathi as the 
third choice after English and Hindi; for Sindhi and Punjabi 
speakers Marathi is the fourth choice. On the other hand 
bilinguals from among the speakers of all major Indian 
languages prefer English and Hindi as the first two choices 


over all other languages, 
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INDO-ARYAN AND TIBETO-BURMAN 
CONTACT 


AS SEEN THROUGH NEPALI AND NEWARI VERB TENSES 


Edward H. Bendix 


1. Introduction! 


Nepali refers to the official language of Nepal and to the 
other Eastern Pahari dialects spoken through Nepal and eastward. 
It is a North Indian language, belonging to the Indo-Aryan branch 
of Indo-European. The Newari language isa Tibeto-Burman group 
of substantially similar dialects spoken primarily in the Kathmandu 
Valley, but also in towns and villages elsewhere in Nepal. Writing 
and publication in both languages uses the Devanagari syllabary. 
The written, taught, formal style of Nepali is almost standardized, 
is in part considered by some to be artificial or pundit speech, and 
differs more or less sharply from native speakers’ colloquial depend- 
ing on education, caste-clan group, and geographical region. Modern 
written or literary Newari is to a large extent a creation of this 
century and is basically derived from a writer's spoken language 
although standardization has set in. Prior to this an archaic. 
classical Newari was used for religious literary texts. It is a prestige 
dialect of Kathmandu, on which writers are basing their standardiz- 
ation, that is described in this paper. 


The comparison to be made here between the two languages 
does not imply that their similarities are due only to their contact 
with one another. For one thing, Nepali has been in centuries-long 
contact with other Tibeto-Burman languages during its eastward 
spread through the Himalayas and is a second language for large 
numbers of Tibeto-Burman speakers. For another, Newari speakers 
repeatedly absorbed immigrant groups from North India that 
originally spoke Indo-Aryan languages. In addition both languages 
share areal features which are doubtless of greater antiquity than 
the present millenium or more and are of broader geographical 
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extent than North India and the Himalayas. There are thus many 
similarities which cannot be unambiguously identified as the product 
solely of Nepali-Newari contact, and even many of those that might 
appear to be so identifiable would require description of neighboring 
languages for substantiation. 


Out of the mass of morphological, syntactic, and other 
parallels, as well as of direct mutual borrowing of forms, evidenced 
between the two languages, this paper will restrict itself to the less 
obvious congruences in the verb and related syntactic behavior. More 
importantly, it will try to show how the semantic study of language 
contact reveals underlying similarities in grammar that differences in 
form would tend to obscure. In any event, Nepali and Newari do 
not exhibit the spectacular relationship, reportedly found elsewhere 
in South Asia, of being virtually one code with two expression systems. 


First, those tense forms that will be discussed are introduced 
and briefly described for the two languages. Next, various semantic 
and other categories will be proposed and exemplified in a characteriz- 
ation of the Newari tense system. After that, the Nepali system 
will be presented in comparison with the Newari one including 
discussion of varieties of Nepali and some peculiarities of the Nepali 
ergative construction illuminated by a Tibeto-Burman model. 


Many of the finer details and considerations have been omitted 
in the discussion of the Newari and Nepali tense systems, and the 
reader should not look for more than the broad outlines. A more 
comprehensive study of the Newari system is in preparation. The 
label ‘‘tense’’ is used only as a very traditional cover term for what 
will be described. Time is only one element of “‘tense’’ as used here, 
and often a minor one. 


2 The Verb Tense Forms to Be Compared? 
2.1. The Nepali Forms 


Where there are person-number distinctions in Nepali verbs, 
only third personsingular forms will be used at this point to 
exemplify categories in Nepali. Other persons are discussed under 
evidentials and the ergative below. Items are cited in their shape 
and use in a formal style, and differing spoken usages are give where 
pertinent later. The verb tenses to be considered here will first be 
exemplified with the stem gar- ‘do’ and then identified. The glosses 
given here and elsewhere are, of course, primarily for translation 
identification, their more precise semantics being a subject of this 
paper. Translation is also resorted to in order to help the reader of 
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English map the Newari and Nepali systems into a familiar grid, in 
place of an etic grid of semantic description, which linguistic science 


has only begun to develop. 


(1) Perfective (a) gar-yo ‘did’ 
(b) gar-eko ‘done’ 
(c) gar-eko cha ‘has done’ 
(d) ga-eko thiyo ‘had done, did’ 
(2) Imperfective (a) gar-ne ‘doing’ 
(b) gar-cha ‘does, will do’ 
(c) gar-thyo ‘used to do, would have 
done’ 
(d) gar-la ‘will/might do’ 


(1) Perfective. (a) -y- the finite past (perfective) tense 
suffix + -o third person singular suffix after -y-. (b) -e perfective 
participle suffix + -ko the broad genitive/adjectivalizing/nominalizing 
suffix. (c) -eko, see (b), + ch- present stative stem of existential- 
locative ‘to be’ + -a third person singular suffix after ch-. (d) -eko, 
see (b), + th (i)- past stative stem, corresponding to ch-, + —yo, 
see (a)- 

(2) Imperfective. (a) -ne imperfective participle suffix. 

(b) -cha, see (Ic). (c) -thyo bound form of thiyo (id). 

(d) -Ja future suffix, with third person singular marked by 

absence of a personal suffix. 


2.2. The Newari Forms 


The Newari verb endings to be considered here are, exemplified 
with the stem yar— ‘do’, as follows: 


(a) yat-o ‘did’ 

(b) ya-v’ ‘does/do, used to do, did’ 

(¢) van-a ‘(I/we) do, used to do, did’ 

(d) ya-i ‘will/might do’ 

(€) ya-e “(I/we) wiil do’ 

(f) ya.v'-gu, yan-a-gu, ya-i-gu, ya-e-gu ‘having done, doing, 

going to do etc.’ 

The Newari verb endings are thus -o, -y', -a, -i, and == 
(These letter symbols will be used without hyphen in the next section 
to identify the tenses in the semantic formulas.) These tenses do not 
distinguish number nor, as explained below, person. The adjecti- 
valizing / nominalizing suffix -gu is added to -o, —a, -i, and -e, but 
not to -v’. Thus, yat-o + -gu yields ya. v'-gu ‘having done’ (i e 
the latter is not from *ya-v’ + -gu). A kind of perfect is formed 
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with du, the locative-existential ‘be’: ya. v'—gu du ‘has / had done’, 
yan-a-gu du ‘(I/we) have had done’. 


3. Semantic and Other Features of the Newari Tense Endings 


3.1. Introduction 


The definitions of the ‘tenses’? will be given more in the 
form of descriptions. A stricter statement in logical form is still in 
the process of being worked out, especially since it also requires the 
preliminary formulation of a descriptive theory including, for 
example, presuppositions, illocutionaries, and felicity conditions. 
The present paper is not the proper context for defending the 
framework of description adopted. The approach to definition 
at this point in the analysis will be that of seeking a core meaning 
for each tense. That is, the tense form is taken as the starting point, 
and, where possible, a single definition will be attempted. The 
definition gives the components that can be factored out of the range 
of usages of the given form and that oppose it to the other forms. 
Then some of these various usages will be described. The ultimate 
goal would be an autonomous statement of the semantic system of 
tenses, linked by rules to the formal expression system. On the: 
evidence, some of the usages to be described seem to be good candi- 
dates for becoming separate categories of the semantic system. 


For the description, the agentive/ergative ‘‘subject’’ of 
transitive verbs will not be distinguished from the subject of intran- 
sitives Since transitivity does not affect the interaction of subject 
with verb endings. A detailed discussion would have to cover the 
semantic interactions of subject and object with one another and 
with the verb and tense meanings. The question of subject, object, 
etc. as categories only of surface grammar will be glossed over for 
simplicity. Thus, x will be used as a variable corresponding to 
subject noun phrase (referent), and F will represent the state or 
action of the main verb phrase without the verb tense endings. That 
is, F represents an action that x performs or undergoes or a state 
that x is in. 

The proposition F (x), with predicate F and argument x, is 
itself the argument of complex higher predicates, and the tenses 
expressed by -v’,-0,-a, etc. are among them. The phonological 
letter symbol of each tense ending will be used, without hyphen, to 
represent the particular complex of (higher) predicates that this tense 
incorporates. For example, wo-n'ya-v' she/he does/used to do (it)’ 
(wo third person singular pronoun, -n’ agentive suffix, yat- ‘do’, 
-y’ characterizing tense suffix) is one realization of v’ (F (x) ), where 
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x is wo and Fis yat-. These notational conventions are adopted to 
simplify the definitions and descriptions of the tenses. 


We start with the evidential categories in the Newari system. 
Given a statement, ina definition, that the speaker s asserts that 
F(x) (or some other proposition), this statement should be under- 
stood to have the following felicity condition. The condition is that 
s know that F/x) and thus that s have had one or more experiences 
that have caused s to know that F(x). In describing the evidential 
components of -o, -v’, -a, -i, and -e, four subtypes of experience 
are specially singled out: (a) s observes at least part of the action 
or state F(x) to occur or exist; (b) s observes circumstantial evidence 
that F(x); (c) s is told that F(x); (d) s is x and intentionally per- 
forms the action F(x). 


To account for the location of S(x) in relation to the time/event 
of speaking at which s asserts, reports, questions, etc. F(x) is pre- 
supposed to carry with it a time variable Time F(x) at which it is or 
can be located. This time variable derives from a variable time 
located. (This time variable derives from a variable time predicate 
taking F(x) aSits argument.) Since the tenses include aspect and 
other characteristics, they have their own time variables, Time o, 
Time v', Time a, etc. These are not necessarily coterminous with 
Time F(x) for the propositions o (f(x)), v’ (F(x)), a (F(x)), ete. 


Each tense definition will be introduced by translations of its 
more neutral usages, i. e. usages elicited with no more than minimal 
contextual cues, whether verbal or nonverbal. 


3.2. o (F(x)) 

For o (F(x)) translations might be ‘x did F’ or ‘x fell to doing 
F’ where F is an action performed by x. States can be translated ‘x 
became F’, *x turned out to be F’, ‘x has become/turned out to be 
F’, ‘xis F’, (This last, ‘x is F’, is discussed in 4.7 below ina com- 
parison with Nepali -yo.) Thus, a part ef a core meaning definition 
we would say that an action or state F(x) comes to be at least once 
For a given occurrence o (F(x)), i. e. in an utterance, Time F(x) ane 
Time o are classes of times (or occasions) with at least one member 
each. This isas in English, where ‘x did F’ or ‘x became F’ for 
instance, do not specify how many times the event took place. 


To describe o (F(x)) somewhat more formally, we Say that the 
speaker s asserts that Fix) comes to be at least once, Or in other words 
s asserts that become (F(x)) at Timeo. The conditions are that 
Time o is before the time of speaking and that the evidence of 5 for 
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making the assertion is at least one experience of s of observing at 
least part of F(x). 


Some comments on this are in order: First, the assertion 
become (F(x) ) would include the assertion of F (x), where Time 
F (x) is after Time o without any intervening time. Next, the above 
definition of o (F (x) ) is a unitary one devised to match a single tense 
form. That is, it assumes that there is one category in the semantic 
system matching one unit in the formal system rather than severa] 
semantic categories matching several homenymous formal units 
(see 3.1 above). Then, although the time of speaking follow Time o, 
it may or may not be included in Time F (x) or in the time of a 
resultant state or action, and one regular use of o (F (x) ) includes 
present-time reference. 


3.3. The Evidential Felicity Condition and the Inchoative 


The evidential felicity condition, it must be stressed, oniy says 
that at least one of the experiences of s must be one of observation; 
it does not imply that the experience of observation is the only one 
that s must have. For example, o (F(x)) may be used where a 
proper translation might be ‘x became F’ and s did not observe 
become (F (x) ) but only F (x). Here s may have known by observa- 
tion or otherwise that x was not F, i.e. not (F (x) ), before the 
Time o of become (F (x) ). However, there are situations to which 
o (F (x) ) is applied where the translation ‘x (has) turned out to be 
F’ or ‘x is / was F’ is appropriate, for example trying on a shoe and 
finding that itis too small. Here s may reasonably assume that 
F (x), rather than not (F (x) ), held before Time o and continued to 
hold through Time o and the Time F (x) actually referred to, i.e. 
the shoe did not suddenly shrink upon being tried on Obviously, 
the import of become (F (x) ) must be worked out more carefully as 
a component of o (F (x) ). 


To do this, a three-valued truth dimension can be set up for 
Newari assertions (it could also be seen as two linked binary 
dimensions): (a) F (x) is established by evidence, (b) mot (F (x) ) 
is established by evidence, (c) the status of F(x) is not (yet) 
established by evidence. Then became (F(x) might be better 
rendered as ‘F (x) comes to be established by evidence’ or ‘F (x) 
comes to be true or proven’. Thus an assertion become (F (x) ) (1.€., 
o (F ’x)) requires no necessary presupposition that not (F (x)) just 
before Time o, nor a condition that s know any such thing, and none 
has been included under the conditions in the definition, Since the 
inchoative label “‘become (F (x))’’ has been used with such a presu- 
pposition in describing other languages, it might, for the sake of the 
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metalanguage, have been better to pick some other label for this 
component in Newari. 


3.4. vw’ (F (x)) 

In the case of y’ (F (x)), where Fis an action performed by x, 
translations might be ‘x does F’ or ‘x used to do F’, and where Fis a 
state, ‘x is F°, “x becomes F’, ‘x used to be F°, ‘x used to become F. 
vy’ (F (x) ) can also refer to a single past-time event, as well as repeated 
past-time events, and overlaps in such cases with e@ (F (x) )- However, 
it does not require direct observation as an evidential condition, which 
o (F (x ) does require. 


A definition of »’ (F (x) ) might be as follows. The speakers 
depicts x as characterized by F(x). To put it differently, s says 
(not “‘asserts’’) that x is such that F(x). This formulation depends 
on the fact that in a question vy’ (F (x)) is not ordinarily answered 
by kho-y’ ‘yes/ it is so/it is true’ or mo-Khu ‘no/ etc.” (from khet- 
“be / become so’), unlike the other tenses. As a question, rv’ (F (x)) 
is ordinarily answered by a repetition of the verb of the question, 
in the same tense. in the affirmative or negative. 


The conditions are as follows. F(x) occurs at least once. 
(Note that this falls under conditions in contrast to the case of 
o (F (x)), where itis asserted.) The Time vy’, during which F (x) 
characterizes x, begins before the time of speaking, i. e. Time ry’ may 
end before the speech event begins, or it may include the latter. 
Time v' includes all Time F(x) referred to by the given utterance of 
v’ (F(x)). The evidence of s is at least one experience of observing 
that F(x) and/or of observing circumstantial evidence that Fo) 
and/or of being told that F(x). This disjunction can be removed 
by a formulation which says that there is at least one experience of s 
which is other than the intentional performance by s himself of the 
F (x) referred to by the utterance, intentional performance being the 


fourth type of evidence mentioned in 3.1 above for making a 
statement. 


Some clarification is needed for this formulation of the 
experience required in evidence. Evidential experience is presented 
for the Newari tenses as a dimension with four values (a) observation 
(b) hearsay, (c) circumstantial evidence, (d) intention. For »’ (F x) ) 
a@ minimnm of one experience from (a), (b), or (Cc), a disjunction 
has been rephrased as a unitary statement of a minimum of Guts 
experience from non- (d). The reason is more than a theoretical 


demand for the economy of unitary definition To : 
; a . 0 hel ; 
a (F (x) ) is described next. P explain this, 
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3.5. a(F(x)) 


In the simple declarative the subject x of a(F (x)) is first 
person, and translations might be ‘I (or ‘we’) did F’, ‘I used to do 
F’, Thus, the speaker s asserts that F (x) occurs at least once, and 
the conditions are: s = x; F(x) isan action: x performs this F (x) 
intentionally; Time a of this intention includes all Time F (x); at 
least one such Time F(x) occurs before the time of speaking ; 
the evidence for the assertion is at least one experience of s which is 
this intentional performance of F (x) (i.e., F(s)). It could be 
argued that only s can claim the internal experience of intentional 
F (x) and that therefore s = x follows naturally and is superfluously 
stated above as 4 condition. In any event, it should be noted that 
S = X Is given as a condition rather than as part of the assertion since 
~a is evidential and not a first-person verb ending: it may occur 
With any person such as, for example, second person in the interro- 
gative and third person in the quotative. A fuller statement would 
thus account for how speaker s can be second or third person. 


Since F (x) must be an intentional action here, states, as well 
as actions defined as involuntary (i. e., where x undergoes F), occur 
with v’ (F (x)) or o(F (x)) even when the subject is first person. © 
Actions defined as unmarked for either intentional or involuntary 
occur freely with any evidential category for first person, information 
about intention being supplied by the category of the tense. 


Actions defined as intentional when in the simple declarative 
with first person subject occur expectedly then with a(F(x)). When these 
conditions are met but v’ (F(x)) or o(F(x)), occurs instead of a(F(x)), 
the speaker s indicates hearsay or observation as evidence for his report 
of his own actions. This has severa] uses. Thus, s may not remember 
doing F and has found out about it in some other way. Ors splits 
himself into two personae, speaking from the point of view of 
someone else or reporting his actions in a dream. There isa general 
effect of talking about oneself in the third person, although the 
first-person pronoun is used. When s employs this usage even 
though he also has the evidential experience of his own intentional 
performance (i. e., even though a (F (x)) would not be infelicitous), 
there is an impression of flippancy, annoyance, distance, or failure 
to accept responsibility for one’s actions in the face of one’s inter- 
locutor, an impolite usage towards one’s seniors. This then is one 
more justification for a unitary statement of the evidential condition 
for v’ (F(x) ) as atleast one experience of nonintention, since that 
is what s communicates when speaking in the first person with 
y' (F (x)). The additional evidence of an intentional experience is 
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not excluded as a possibility by this evidential condition for y'(F(x)): 
it is simply not mentioned. 


3.6. e(F(x)) and i (F (x) ) 

These two tense forms will not be described in as much detail 
as the previous three. Typical translations of e (F(x) ) are in the 
first person: ‘I (or ‘we’) will/shall do F’, ‘I would do F’, Thus, 
e(F(x)) is in the same evidential category with a(F(x)). It 
involves actions as opposed to states; the actions are defined as 
intentional or unmarked as opposed to those defined as unintentional; 
s = x; F(x) will occur at least once. In contrast to a(F (x)), 
e(F(x)) has adifferent time reference: Time F(x) is future, but 
the evidence of s for making the prediction (assertion?) is present 
intention of s. 

The use of i(F(x)) in the first person declarative when 
e (F (x) ) would also seem appropriate gives similar effects to those 
described in the preceding section. i(F(x)) falls in the same 
evidential category with o (F(x)), and possibly v’ (F (x)), in that 
the evidence of the speaker s that is required asa condition is other 
than intention of s. Time reference is in some sense future. F (x) 
will occur at least once. 


There are more reasons than for o(F(x) to believe that i(F(x)) 
is best considered not one form, but several homonymous forms. 
These reasons will not be gone into here in detail. They consist of 
such things as differing ways of forming the negative, selective com- 
bination with -gu, and informant reactions of actual ambiguity 
The last, ambiguity, is distinguished from simple generality or 
vagueness of reference, Like ambiguity, generality must often be 
rendered by more than one translation, but, unlike ambiguity, such 
multiple renderings derive from greater fineness of the target language 
on the meaning in question compared to the source language. In 
any event, the following breakdown of i (F(x) into several homony- 
mous forms is tentative. 


(a) One sense expressed by i(F(x)) can be translated ‘x will 
(do/be/become) F’ or ‘x is to/was to F’. As the independent clause 
of a conditional construction, it may be rendered ‘x would F’. (b) 
Another sense translates at ‘x may/might F’. (c) A kind of present- 
time usage of i(F(x)) can be translated by a similar use of the English 
future with will to express a repeated or habitual occurrence, e. g. ‘on 
a warm summer’s night he wil] often drop by after Supper’. A core 
meaning can be offered for (a), (b), and (c) that involves and evidential 
in the form of a conditional statement : ‘if y observes x at time j 
will observe F(x)’. = 
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3.7. The Tense Forms with ‘gu’ 


The suffix —gu is added to the tense endings -o, -a, -i, and -e, 
but not to -v’. Thus, wo-n’ ya. v/-gu ‘he (she) having done (it)’ 
corresponds to wo-n’ yat-o ‘he did (it) (wo third person singular 
pronoun, -n’ agentive suffix, yat- ‘do’); ji-n’ yan-a-gu ‘I doing / 
having done’ to ji-n’ yan-a ‘I do/did’ (ji first person singular) ; 
wo-n' ya-i-gu ‘he doing’ to wo-n' ya-i ‘he will do’; ji-n’ ya e-gu 
‘I doing’ to ji-n’ ya-e ‘I will do’. The-gu form adjectivalizes a verb 
phrase, rendering the equivalent of a relative clause: ji-n’ yan-a-gu jya 
‘(the) work that I do/did’ from ji-n’ jya yan-a ‘I do/did (the) work’. 
This can further be nominalized to ji-n’ yan-a-gu ‘what I do/did; 
the fact of my doing (it)’. Similarly, wo-n' ya. vy’. gu ‘what he did; 
the fact of his having done (it)’. Relativized usage such as ‘the work 
that he did’ or ‘what he did’ does not assert in itself that he did it 
but rather presupposes the truth of his havig done it, and this holds 
for the —gu forms. 


Furthermore, -gu clauses regularly occur as complete sentences 
and do not become assertions of the action itself thereby, but remain 
presuppositions, Take, for example, the following conversational 
sequence, A: wo-n' yat-o ‘he did (it). B: gu—boioe ya. v'-gu? — 
when did (he) do (it)?’. A: mhigo ya. v'-gu ‘ (he) did (it) yesterday’. 
The action having been asserted once in the first statement, the iast 
statement need not reassert it. Knowledge already shared by the 
interlocutors is also stated ina-gu sentence. The above question, 
gu-bolve ya. v'-gu, presupposes that he did it and asks when. Such 
questions can also occur without -gu, e.g. gu-boloe yat-o? They are 
then both assertions and questions: ‘he did it; now, when did he do 
it?’. Such a form is understandably avoided when speaking to 
persons that one owes respect. 


Although -v’ does not have a separate -gu form, the function 
of the latter is served in other ways. When-v’ has the same time 
and event reference as -o, it corresponds to the .v/-gu (from -o-+ : gu) 
of -o. This -gu neutralizes the evidential condition of direct obser- 
vation required for -o, a condition which -y’ doesnot have. When 
-y’ refer to the habitual (or interative), it corresponds to the -i-gu 
of - i in the latters reference to habitual action. 


Form with -gu also form a perfect with du, the -v’ form of 
dot- existential ‘be’: ji-n' yan-a-gudu ‘I have/had done (it)’, 
wo-n’ ya. v'-gu du ‘he has/had done (it)’. Although won’ yat-o 
the did (it)’, as an -o form, can also cover reference to the result of 
the action as ‘he has done (it)’, the -gu du form asserts the result 
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as well. Thus wo wol -o ‘he came/has come/had come’, but wo wo. 
vy’ -gu du ‘he came/etc. and is/was here’. 


4. A Comparison of the Nepali and Newari Tense Systems 


The formal or official style of Nepali is presented unless 
specified otherwise. For grammatical characterization of the Nepali 
tense forms, refer to 2.1 above. For fuller description of the semantic 
categories, see under the appropriate Newari form in the preceding 
section on the meanings of Newari tense endings. 


4.1. Time and Aspect Categories 


A chart at the end of this subsection summarizes comparison 
with aspect in broad outline. 


The Nepali -yo form is equivalent to Newari -o in time 
reference and aspect. (Newari -a is compared separately below.) 
Thus, Nepali us-le gar-yo (us oblique of u third person singular 
pronoun, ~/e agentive suffix, gar- ‘do’) and Newari wo-n’ yat-o are 
both ‘he did/has done (it). Both are perfective in aspect and de- 
pending on the nature of the verb and the context, may be inchoative. 
That is, the action began and finished coming into being and may 
also be completed, or the state finished coming to be and may or 
may not still exist at present. This also allows for a kind of in- 
choative or inceptive of ongoing action, depending onthe proper 
context: ‘fell to doing/has fallen to doing/is doing’. 


In both languages, the inchoative of state may be either 
‘came/has come to be’ or ‘came/has come to be true, turned/has 
turned out to be so’. Thus, for example, Nepali Thik bha-yo (Thik 
‘O. K.’, bha- perfective stem ‘be’) and Newari thik jul-o (thik 
‘O. K.’, jul-the equational or class-membership ‘be’) can both 
translate as ‘(it) became/has become O.K. (not having been so 
before)’, i. e. an actual change of state, or ‘(it) turned/has turned 
out to be O. K. (and presumably was O. K. before but without my, 
her, etc. knowledge)’. Furthermore, for states, -yo and -o may 
translate not just ‘x has become F’, but simply ‘xis F’, In this 
latter sense, these perfectives are particularizing; ‘x is F’ on this 
occasion, as opposed to a characterizing tense in which ‘x is F’asa 
characteristic, regulariy, if one were to observe Xelc: 


The Newari -gu forms correspond to the Nepali -eko per- 
fective and -ne imperfective participial forms, i, e. -o + ~su 
(> v'-gu) to -eko, and -i-gu/-e-gu to -ne (see below for -a-gu). As 
described above for Newari, the Nepali -eko and -ne suffixes form rela- 
tivized (adjectivalized) and nominalized clauses. Such Clauses like-wise, 
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can also stand as independent sentences expressing presuppositions 
rather than assertions. The same social implication, given as an 
example for Newari, follows from their meaning: e. g., timi-le ke 
gar-eko? what have you done ?’ vs. timi-le ke gar-yaw? ‘I see you’ve 
done something; now, what was it?’. Basically, the -eko and -ne 
forms correspond to the -yo and -cha tenses respectively with regard 
to aspect. (For the imperfective -cha tense, see below.) -eko and 
-ne are quite common as the finite verb forms of independent 
sentences in the spoken language, and one bureaucratic style for 
interoffice memoranda is cast primarily in such sentences. 


We see here a case where differences of form obscure similari- 
ties in function. Nepali has two adjectivalizing/nominalizing suffixes 
to differentiate aspect, whereas Newari has one and expresses aspect 
in the usual verb endings preceding this suffix, (Actually, the two 
Nepali suffixes are each segmentable, e. g.-eko contains -e-asa 
separate marker of the perfective). 


Nepali -eko cha, as us-le gar-eko cha ‘he has done’, and -eko 
thiyo, as us-le gar-ako thiya ‘he had done/did’, make a distinction 
of time which the single Newari -gu du as wo-n’ ya.v'-gu du ‘he has/had 
done’, does not. The latter includes Nepali -eko cha and overlaps 
with -eko thiyo. In both languages, as described in the discussion on 
Newari, these forms assert not only the action but also some sort of 
state or situation as resulting from the action and continuing after it 
up to some reference point. This isin contrast to the simple _per- 
fectivity of Nepali -yo and Newari -o which may not express continu- 
ance of the resultant. 

The Newari -v’ and -i (and -e) and Nepali -cha, -thyo and 
-]4 overlap in most of their senses. The equivalents of ‘do’ will be 
used to exemplify the comparison. Newari ya-v’, when referring to 
past habitual (characterizing) ‘used to do’, corresponds to this sense 
of Nepali gar-thyo. In present habitual (characterizing) usage, ya-y' 
corresponds to Nepali gar-cha ‘does’. gar-cha as ‘does’ also covers 
Newari ya-i in the latter’s present habitual sense. (See 3.6 above for 
the uses of -i.) gar-cha as definite future, ‘will do’, renders this same 
sense of ya-i. The contingent future of Newari ya-i, ‘may do’ or 
‘(watch out: he) might do (it)’, corresponds to Nepali gar-/a. 
Newari ya-e ‘(I) will do’, asa statement of the speaker’s future 
intention, is rendered by the first-person form of Nepali gar-cha. 
Correspondences change slightly in the interrogative, where, for 
instance, request for permission may be expressed. 


Nepali also has a future form, gar-ne-cha ‘will do’, expressing 
a strong assertion or prediction of a future event in the spoken 
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language. In a modern, formal, often journalistic, written style, it 
is equivalent to the simple future of Western languages, particularly 


English. 


Newari -a (and -a-gu) expresses only nonfuture time with 
regard to tense, and the evidential category of intentional action, 
assertable only by the performer of the action. In time and aspect, 
therefore, it is more general and covers the range of the Newarl non- 
intentional -o (.y/-gu) and -v’ both. It has no equivalent in Nepali 
and, in the simple declarative, corresponds to the first-person forms 
of Nepali third-person -yo -cha, -thyo (and -eko and -ne). Likewise, 
the perfect -a-gu du in the declarative expresses the first-person froms 
of Nepali -eko cha and eko thiyo. 


Summary Chart 


Approximate Tense-Aspect Correspondences 


Newari Nepali Glosses with ‘do’ 
| —ne ‘doing’ 
-gu sae 
-eko ‘having done’ 
-eko cha ‘has done’ 
-gu du 23 
-eko thiyo ‘had done’ 
-0 ~yo ‘did’ 
-thyo ‘used to do’ 
_y/ ey 
‘does’ 
—————-| -cha...... a 
‘will do’ 
-i = Se 
la ‘may/might do’ 


—— ee 
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4.2. Evidential Categories and Personal Endings of Verbs 


As said, Newari endings may be put into two classes, -a -e 
for intentional associated with first person, and -v’ -o -j for non- 
intentional (i. e., unmarked for intention). As evidential categories, 
these forms may occur with any person however, depending on 
evidence, affirmative vs. interrogative, assertive vs. quotative, and 
other factors, and furthermore do not distinguish number. (In 
addition Newari and Nepali have a set of high-honorific verb forms 
which are not considered here but do not essentially pose any exce- 
ptions to the tense systems. ) 

In the formal or official style of Nepali, the —yo tense is neutral 
as to the speaker's evidence for the assertion, in contrast 
to the aspectually corresponding Newari -o of direct observation 
but there are speakers who would avoid _yo if their evidence is 
hearsay. Both languages add quotative particles, Nepali re and 
Newari hd, to verb forms to specify hearsay evidence. The Nepali 
first-person ending and Newari -a may both occur with these 
quotative particles with third person reference if the speaker’s source 
for the information is the performer of the action. Nepali has a 
further form, e. g. gar—e-cha, for actions or states found out about 
after the fact by later observation, hearsay, etc. 


In the formal style, Nepali, has the Indo-European system of 
verb-tense endings consisting of a differentiation of three persons and 
two numbers. The second person, however, is tied in with the scale 
of honorifics, and its singular represents the low grade of the 
honorific scale, where its ‘‘plural’’ is middle grade with singular vs. 
plural being marked in the pronoun’ with gar- ‘do’, for example, 
in the past (perfective) tense and for tenses using (-) cha, the six 


forms are: 


Sing. Plur. Sing. Prur. 
TP.) fae gar-yaw  chu/chi chaw 
2P.  gar-is gar-yaw —_chas chaw 
3P.  gar-yo _ gar-e cha chan 


In the various spoken styles and dialects, however, not all of 
these distinctions are found. For example, in the few, earlier 
grammars of Nepali, often based on the speech of hill peoples serving 
outside of Nepal in the Gurkha regiments, notes, may be found to 
the effect that in speaking only the 1P. sing. and 3P. sing. forms need 
be used, the 3P. sing. for all non-1P. This begins to give the appe- 
arance of the two main evidential categories of the Tibeto-Burman 
Newari: the intentional, associated with (although not restricted to) 
1P., and the nonintentional, associated with the other persons. 
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Because of social and regional variation, a further exam ple ofa 
Nepali colloquial will be given from the speech of a single individual, 
born in the Gurkha district, raised later in Kathmandu, educated, 
from a wealthy Chetri family. Primarily utilitarian in his approch 
to language, he describes the range from colloquial through formal 
speech to writing in terms of the rigor with which teachers in school 
corrected given items in students’ speech and writing. In his collo- 
quial speech as observed, and, he claims, that of his associates and 
family, asa rule the 1P. and 3P. sing. verb endings are used 
regardless of whether the subject is singular or plural. In the perfect 
tense -eko cha with transitive verbs, the 3P. sing. form cha tends to 
be used for all persons and numbers. (Note that transitives in 
perfective tenses are ergative, the ‘subject’? taking the agentive suffix 
-le.) This is parallel to the Newari perfect -gu du, where du is the 
invariable form. In fast speech, the 2P. ‘‘plur.”” ending of the -yo 
tense, namely -yaw [yow], tends to simplify phonologically 
and reduce to -yo [yo], and then both to reduce further to [yo]. 


In his variety of Nepali colloquial, then, there is no distinction 
of number in the verb endings, which only distinguish person (except 
ia the -eko cha tense). The 2P. “‘plur.’’ endings remain distinct 
from 2P, sing., but, as said above, this is really an honorific distin- 
ction and note one of plural ys. singular number. Also, the 3P. 
‘“‘plur.’’ verb ending of any tense may be used with a higher grade 
honorific function, without in fact distinguishing number semantically, 
with certain honorific pronoun (and noun) subject forms. Only these 
honorific pronoun forms will be marked for number. To use these 
honorific pronouns but replace the 3P. ‘‘plur.’’ verb endings with the 
corresponding 3P. ‘‘sing.’? ones may be found humorous and labeled 
pakhe, the speech of acountry bumpkin making gaffes in the use 
of forms he has not mastered. 


It should be noted that the reductions of person-number 
distinctions in the verb endings in this variety of Nepali seem to result 
from more than one factor (e. g., -yaw becoming -yo appears to 
depend on tempo), but are all in the direction of the Tibeto-Burman 
system exemplified by Newari. 


Although Indo-Aryan influence on Newari lies not so strongly 
in the area discussed here, there is reported to be a form of Newari 
spoken in the town of Patan, near Kathmandu, in which the — 


intentional category has lost its evidential function and become fixed 
as a first-person ending. 
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4.3. The Ergative and Subject-Verb Concord 


Newari, like related Tibeto-Burman languages, attaches an 
agentive suffix to the “‘subject’’ of a transitive verb in any tense- 
aspect, but not to the subject of an intransitive. This does not, 
however, affect the verb endings. Being evidential rather than con- 
cord or agreement suffixes, the latter continue to interact in the 
Same way with the person of the subject regardless of whether the verb 
is transitive or an intransitive intentional action. Thus, the agentive 
suffix -n’ does not affect the verb endings: ji-n’-ya-a ‘I did (it)’ 
and ji won-a ‘I went’ both have -a; wo-n' yat-o ‘he did (it)’ and 
wo won-o ‘he went’ both have -o. 


In the more westerly of the North Indian languages and 
dialects, the tendency is to use the ergative (from the Sanskrit 
passive in the perfective tenses of transitive verbs, marking the 
‘‘subject”’ usually with an agentive suffix; other verb forms are not 
ergative. Furthermore, verb endings are concord suffixes and agree 
in person and number, when there is agreement, with the ‘‘object*’ 
of transitive verbs in the perfective and with the subject of all other 


verb forms. 


Nepali, a North Indian language, lies somewhere between 
these two systems. The agentive is regularly suffixed to the subject 
of transitives in perfective forms of the verb. However, usage with 
other verb forms of the transitives is variable, and the agentive suffix 
may be added to the subject in such cases too, depending on the 
particular nonperfective in which the transitive verb occurs and on 
such things as emphasis, speaker, style, dialect, etc. Furthermore, 
the verb endings, which are concord suffixes ratber than evidentials 
at least in more formal style, agree in person (and number) with 
the subject of transitive verbs regardless of whether the agentive 
suffix appears, and of course with the subject of intransitives. Thus, 
Nepali verb endings follow a North Indian pattern in terms of person 
(and number) concord but show a Tibeto-Burman trait when 
remaining unaffected by ergativity. For example, may-le gar-é ‘I 
did (it)? and ma ga-é ‘I went’ both have -€; us-le gar-yo ‘he did (it)’ 


and u ga-yo ‘he went’ both have -yo. 


5. Conclusion 


As said at the beginning, Nepali and Newari have at this 
point been taken only as examples of the convergences discussed. 
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It is not necessarily clear for all of the many parallels between 
them, which are due specifically to the several centuries of contact 
between the two languages in the Kathmandu Valley and which are 
prior diffusions of broader areal features of this part of South Asia, 
including other ancient and more recent Indo-Aryan and Tibeto- 


Burman contacts. 


Only major points in broad outline have been touched on in 
this comparison. There are other, more complex verb constructions 
and, of course, finer details and subtler points in tense usage not 
covered here. The area of comparison was one in which parailels 
are not as striking and lie more in a semantic rapprochement where 
convergence in form lags behind. Thus, a form in one language may 
add a function of aform in another without becoming identical to 
the latter in all senses, as, for example, a topic not covered, the 
influence between two Nepali verbs ‘to be’. and three Newari ones. 
A general syntactic comparison would merely have repeated basic 
patterns of the broader South Asian area. In the case of expanded 
verb morphology, many similarities would stand out: for instance, 
to express the continuous as in ‘is doing’, Nepali has gar-i-rah-e-ko- 
cha and Newari yan-a-cwon. v'-gu du; both are perfects ina sort of 
inchoative or inceptive of ongoing action and consist of gar-/yan- 
‘do’, -i/-a absolutive participle suffix, rah-/cwon- ‘remain’,-e/-o >.y/ 
perfective, -ko/-gu adjectivalizing suffix, cha/du existential-locative 
verb ‘be’. Such comparisons would have been more salient and 
certainly more convincing, but perhaps also more obvious. 


NOTES 


1. This paper is based on research carried out in Nepal in 
1965-1966, supported by Public Health Service research grant 
No. MH 05743. Works consulted for Nepali include: Clark 1963 
Meerendonk 1964, Rogers 1950, Southworth 1967. 


2. The following transcription System are used. Only a few 
phonological observations are made. 


Nepan P YT. ck, bd Dig, sh, mn, rl,wy; ieaauo 
I D are retroflex. Aspiration / breathiness {s marked with h (ph bh, 
etc.). Nasalization is marked with a tilde (€, 0, etc.) a ae 
broad schwa range. a is low central to front. ae ; 
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Wewari: ptck, bdjeshimn, ri, wy; te@ou. h marks 
aspiration / breathiness. A tilde marks nasalization. vy marks a long 
vowel (iv, av, etc.). o covers a broad schwa range. a is low 
central to front. wo includes [o] as a variant. Arguments can be 
made for treating e as yo. oe / o-e includes [¢:] as avariant. ae/a-e 
includes [x:] as a variant. (The hyphen represents one of the 
morpheme boundaries.) n’ represents lengthening and nasalization of 
the preceding vowel (e.g., an’/a-n' [&:]; the acute accent marks 
special morphophonemic symbols). v’ represents morphophonemic 
consonant loss, realized as lengthening of the preceding vowel, with 
nasalization if the deleted consonant is a nasal. (Thus, e. g., an'/a-n' 
and any’ | any’ are both realized as [4:]. 


Nepali @ and Newari a are phonetically equivalent, as are 
Nepali a and Newari o, and both of the latter inclde a centralized 
lower back rounded variant. Nepali o and Newari wo are approxi- 
mately equivalent. However, for many Kathmandu speakers of 
Nepali, if not for others as well, a five-vowel system, as in Newari, 
would be appropriate, with [o] represented as w plus schwa rather 


than as a separate o. 
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CONVERGENCE IN DIGLOSSIA: 
THE SINHALESE SITUATION 


M. W. Sugathapala De Siiva 


(1) In his monumental work on Buddhist Hybrid Sanskrit 
Franklin Edgerton (1953) describes a type of convergence 
(although he does not use the term convergence) that has taken place 
in a contact situaion involving a High language (Sanskrit) and one 
or two vernaculars (Middle Indic). As Edgerton has demonstrated, 
the hybridisms created by this convergence are in a cline towards the 
High vatiety as is the case with every creole situation which sets a 
model language as the highest target: the modification of Middle 
Indic in Edgerton’s analysis are purely devices to accommodate real 
(and pseudo-) Sanskritisms. Of Buddhist Hybrid Sanskrit, Edgerton 
says, 


‘The most striking peculiarity of this language is that from the 
very beginning of its tradition as we know it (that is, according 
to the mss. we have), and increasingly as time went on, it was 
modified in the direction of standard Sanskrit, while still retaining 
evidences of its Middle Indic orgin. In all its texts, even the 
oldest... Sanskritisms are constantly presented cheek by jowl 
with Middle Indic forms, and often with hybrids which are 
neither one nor the other. These Sanskritisms are much too 
common to be comparable with stray Sanskrit loanwords or 
loan-forms which may have been occasionally adopted in many 
a genuine Middle Indic vernacular.’ (p. 4) 


Buddhist Hybrid Sanskrit is only one example of the product 
of linguistic contact which involves a high literary variety on the one 
hand and one or more vernacular varieties on the other. 


| This type of hybridism, arising out of the disparity between 
high and low varieties, characterises the. various Stages of evolution 
in the whole Indic family (and, perhaps, in all South Asian 
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languages). It is pertinent, therefore, to make a thumbnail sketch 
of the diglossic character of Indic languages. 


The Vedic and Sanskrit languages of the Old Indic period 
which are sometimes referred to as the same and sometimes as 
different languages present a diglossia of a type similar to, though 
within the limits of the available data, not of the same magnitude as, 
Say, that in Sinhalese! The migration of the Aryan people into the 
sub-continent from the North Western frontier, followed by an 
eastward spread, resulting in the establishment of three verancular 
regions as Western, Central and Eastern, has been discussed in detail 
in such works as S. K. Chatterji (1960.) Chatterji discusses the reasons 
for and the nature of the linguistic snobbery of the Western elite and 
the resultant puristic endeavour which gave birth to Classical 
Sanskrit. From this point of view, Classical Sanskrit was the result 
of the measures taken by the Western elite to stem the tide of 
linguistic ‘corruption’ setting in elsewhere. Although it is not 
certain whether Classical Sanskrit was ever spoken by the people at 
large or not, there is some evidence to postulate that it was the result 
of a deliberate modification of a language which was in actual use. 
That, in consonance with the fact that artificial restorations result in 
hybridism, the Sanskrit language which embodies the Paninean 
tradition came to be different from the older language which was the 
target as well as from the language(s) actualy in use, is supported 
by the non-Indo-European characteristics which are found in Sanskrit, 
especially in its lexis. If Sanskrit is an organized high (literary) 
language and Vedic the corresponding popular spoken language, then 
the differences between Sanskrit and Vedic must be viewed as signals 
of a diglossia, The absence in Sanskrit of the []] and [lh] allaphones 
of the phonemes /d/ and /dh/, the abandonment of the phoneme 
combinations /iy/ and /vu/, the creation of grammatical sandhis in 
lieu of wholly euphonic ones are some of the characteristics in which 
Sanskrit differs Vedic: these are discussed in detail in Burrow (1965): 


and Pischel (1965). In Burrow’s words, 


‘From the beginning, from the time of the composition of 
the Vedic hymns and the establishment thereby of a recognized 
literary language, there was a strong tendency among the 
Brahmans, the guardians of this literature and of the religious 
and social systems that went with it, to preserve the language 
against change. This applied not only to the preservation of the 
sacred texts themselves, which have been handed down with 
scrupulous accuracy by oral tradition, or to the composition of 
literary works on ancient models, but also to the language of 
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everyday speech among the Brahmans, and in the royal courts 
with which they were always associated. This led to a growing 
divergence between the language of the educated classes and that 
of the people, which was subject to a fairly rapid alteration of 
Middle Indo-Aryan from an early period.’ (p. 36) 


The great grammarian Patanjali’s remark with reference to popular 
dialects that ‘for each correct form in Sanskrit there are many 
incorrect or corrupt formations’ (Mahabhasya.¥, p.5) is further 
testimony to this puristic tendency. The Old Indic situation 
establishes that diglossia, once initiated, tends to continue, each 
variety, of course, changing with time, the relevant varieties, nonethe- 
less, always remaining distinct; it also establishes that, in diglossia 
situations of that order, the distinct varieties continually converge to 
produce hybrid varieties at every stage in the history of the family. 

The treatises known as the Brahmanas (of commentarial 
nature) and the Sutras (of ritualistic nature) provide further examples 
to illustrate the manner in which hybridism results in High-Low 
contact situations. On these works Burrow remarks, 


The language of the Brahmanas ‘still retains pre-classical features, 
such as the use of the old subjunctive, but already the majority 
of the old Vedic forms have fallen into disuse By the time 
of the composition of the Sidtras the language has reached in 
all essentials the stage at which it was codified by Panini. In all 
the Satras it is possible to find grammatical forms which do not 
conform strictly to the Paninean rules, But in contrast to the 
earlier literature these forms are not as a rule archaisms. The 
difference is rather that their usage is somewhat more lax and 
careless than that allowed by the strict formulation of the 
grammarians’. (p. 44) 


The linguistic disparity evinced by the difference between the 
Classical Sanskrit language and the language of the Brahmanas and 
the Sitras on the one hand and between them and the popular 
vernaculars on the other illustrates the rise in India at least eight 
centuries before Christ of a form of diglossia within which the literar 
aspects contained the formal Classical Sanskrit used by the elite as 
the Brahmana type or popular Sanskrit used by the priestly classes 
while the spoken aspect was represented by the language(s) of the 
people at large. Measured in terms of Classica] Buask rit and th 
vernaculars, popular Sanskrit of the Brahmanas is a hybrid variet : 
The whole of the Middle Indic period is characterised by a digl ia 
situation comparable to this in that the literary aspect had - te 
variety and one or more hybrid varieties. With the neve 
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linguistic changes that influenced even the most orthodox forms of 
purism, the Middle Indic period inherited as hybrid varieties a rather 
non-Paninean kind of literary Sanskrit as well as forms of Middle 
Indic languages adapted for literary purposes-which are known as 
Literary Prakrits. Correspondingly, the colloquial end of the 
Scale contained the languages spoken in the various geographical areas. 
According to the information provided by native Prakrit grammarians 
and writers on literary theory in India one might even speak of a 
four fold differentiation containing high literary, literary and popular 
literary as literary form which contrast with the popular speech 
(see Pischel 1965: Intro.). Of these,. literary and popular literary 
varieties are both characterised by hybridisms, although, as one 
would expect, popular literary is more markedly so. 


One reason for the emergence of literary Sanskrit with all its 
hybrid character is the resurrection of Sanskrit by the Brahman caste 
who regained power after a period of Ksatriya domination: during 
the period of Ksatriya domination, religious and administrative work 
was conducted in formalized vernaculars: for example, Buddhism 
‘was taught in formalized Magadhi and Jainism in formalized 
Ardhamagadhi; the Buddhist Emperor Asoka of the Maurya dynasty 
_wrote his administrative edicts in Prakrits. Again, as Burrow points 
out, 


‘After the Christian era Sanskrit too began to appear in inscri- 
ptions, at first in competition with Prakrit, and finally in 
exclusive use. The inscription of Rudradaman (A. D. 150) marks 
the victory of Sanskrit in one part of India. Inthe South 
Prakrit remained in use longer and was not finally ousted by 
Sanskrit until the fourth or fifth century A. D. Eventually the 
use of Prakrit was discontinued entirely and from the Gupta 
period to the Mohammedan invasion Sanskrit-admittedly often 
incorrect Sanskrit-remained in exclusive use’. (p. 58) 


It is interesting to note that many of the early Indian writers 
on literary theory as well as many early Indian grammarians dealing 
with the Prakrit languages recognize three, and sometimes four, 
varieties of language used for literary purposes. Namisadhu com- 
menting on Rudrata’s Kavyalamkara a treatise on literature, 
recognizes three varieties as Sanskrit, Prakrit and Apabhraméa. 
Dandin in his Kavyadarsa, another work on literary theory, mentions 
four varieties, namely, Sanskrit, Prakrit, Apabhramga and mixed. 
The actual implication of the term Apabhraméfa is difficult to ascertain. 
It is significant that Prakrit grammarians like Vararuci do not 
mention Apabhraméa at all among the (literary) languages they des- 
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cribed, Most writers on literary theory make provision for the use 
of Apabhraméa as a medium of literary composition. Did the 
grammarians ignore Apabhraméa because it was so hybrid and did 
yield to and systematisation? Pischel regards Apabhramsas as the 
popular languages insofar as they were not used mainly for literary 
purposes. Relying upon the works of Prakrit grammarians and 
Indian rhetoricians and the interpretation of their works by such 
scholars as Pischel one may safely conjecture that there were at least 
two varieties of Apabhraméa, namely, literary ApabhramSsa and the 
various popular languages also known by the same name because of 
their extreme divergence from Sanskrit; ApabhraméSa literally means 
‘corrupt’. In this way it is feasible to talk of a four-fold distinction 
as high literary (Sanskrit), literary (Prakrits), popular literary 
(ApabhramSa) and colloquial (Apabhramsa). The literary and 
popular literary languages show various stages of hybridism, measur- 
able in terms of high literary and colloquial (of a given time). 


Pischel remarks that. 


‘the Prakrit languages are artificial dialects insofar as they are 
considerably altered from literary viewpoints by poets. But 
they are not artificial dialects, if thereby is meant that they are 
the result of the efforts of the poets. What is true with these 
dialects is equally true with Sanskrit, which has never been the 
commonly spoken language of the cultured Indians. But 
Sanskrit was not the basis of the Prakrit dialects, which indeed 
go back to acertain popular spoken dialect, which on political 
or religious ground, was raised to the status of a literary 
medium’. (p. 4) 


The mention of terms like Sauraseni Prakrit along with terms 
like Sauraseni Apabhraméa seems to suggest that there was for every 
Prakrit a corresponding literary Apabhraméa and a Colioquial 
ApabhraméSa. 


The Middle Indic charateristic discussed above, it must be 
emphasised, is a typical characteristic of diglossia situations which 
have had a long history: with the inevitable changes that take place 
in the language simultaneously with the puristic efforts to retain a 
distinction between spoken and literary usages, many hybrid varieties 
of literary standard naturally emerge, some maintaining close affinity 
with the older literary form and used by a very small circle of writers 
and others originating from hybridisms produced by the efforts on 
the one hand te sifting a standard for literature from the spoken 
language and on the other hand to following, though not quite 
successfully, the old literary model asa target. Corresponding to 
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the high literary, literary and popular literary varieties used in 
Writing it is feasible to divide literary purposes too as high literary, 
literary and popular literary purposes. As mentioned before, the 
relationship of intermediate (hybrid) literary varieties to the high 
literary variety and the spoken variey is comparable to that obtained 
between creoles and the contributing languages. The general nature 
of the multiplication of literary varieties in the evolution of languages 
with a long standing diglossia is shown in diagram 1. In my diagram 
the symbol +--+ indicates that the varieties are decidedly of the 
Same language; +----+ indicates that although a relatioship is 
recognized its identification is not as clear as in the previous case: 
// indicates the introduction of a kind of bilingualism into the situation, 
resulting from the retention of an archaic language, unintelligible to 
the people, as a high literary variety; thick, broken and dotted lines 
indicate degrees of historical relationship in the evolution of the 
linguistic situation stated. This diagram is of no significance to 
historical linguistics for it is far too general and is intended only asa 
scheme for observing the varieties-split in diglossia. The number of 
literary varieties operating during a given stage is conditioned by 
factors other than those that can be tabulated in a diagram of this 
sort. The diagram shows the maximum potential within the same 
language: the systematic varieties realised are as many or less accord- 
ing to requirements, but never more. The varieties whose evolution 
is indicated by dotted or broken lines only are hybrid varieties, not 
systematic by definition, and caused by the convergence of two 
varieties. 


This notion is applied to the Indic situation in diagram 2. 
As Stage 1, I have taken Old Indic, as Stage 2 Middle Indic and as 
Stage 3 Sinhalese. In introducing Sinhalese into the diagram, I am 
anticipating what I shall have to say later about the Sinhalese 
situation. It is admitted that all modern Indic languages do not 
behave like Sinhalese. It will be noticed that the literary varieties in 
Sinhalese do not show the maximum varieties potential. As mentioned 
before there can always be a difference between the maximum varieties 
potential and the socially necessitated varieties selection. The social 
reasons for this limited selection in Sinhalese will be explained in due 
course. (a) and (b) in Stage 3 are two distinct periods in the history 
which will, again, be explained later on. 


By definition diglossia constitutes two varieties the differences 
between which are systematically statable at the phonological, 
grammatical, lexical and situational levels. These varieties, like all 
varieties, have mutually exclusive social uses. It is axiomatic that 
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in diglossia situations hybridism inevitably takes place resulting in a 
fluctuating variety or varieties which do not form part of the diglossia 
Owing to the unsystematic nature involved. Diglossia implies varieties 
the mutually exclusiveness of which can be Systematically stated; 
where the boundaries are not clear enough to facilitate systematic 
statements, the term ‘hybridism’ is suggested in order to stress the 
fluctuating character involved. 

The foregoing introduction would suffice to illustrate three 
major characteristics of diglossia. Firstly, the maintenance of diglossia 
(at least in the languages under examination) is a puristic endeavour. 
Secondly, despite purism, the divergent varieties, while maintaining 
their individual character, tend to converge and generate a multiplicity 
of hybrid forms under certain circumstances (e. g. popularisation of 
literary pursuits), Thirdly, these hybrid varieties have a tendency 
to fluctute between extremes and behave in the way varieties of creole 
do in the context of model languages. Buddhist Hybrid Sanskrit and 
the various popular literary Prakrits exemplify these characteristics. 
The Sinhalese situation, as will be seen from the rest of this paper, is 
no different from these older situations in this respect. The purpose 
of this introduction is, indeed, to illustrate the generalisability of the 
nature of convergence irrespective of the chronological distance 
involved in the defining situations. I shall attempt to show the 
effects of purism on the creation and sustenance of Sinhalese 
diglossia and the effects the popularisation of literature has had on 
the emergence of what I propose to call ‘Hybrid Sinhalese’. 


(2) A short history of Sinhalese diglossia: In languages with distinct 
high and low varieties, the high variety is as a rule sustained by a 
minority. Had it not been for the puritanical elite the literary variety 
of Sinhalese would have vanished leaving only its traces in the hybrid 
forms of the language. Minorities usually endeavour to preserve the 
institutions which symbolise them. This phenomenon has been 
referred to as nativism, for, says Kroeber ‘after two societies have 
come into sufficiently close contact for one to feel the other as 
definitely more populous, stronger, or better equipped, so that its 
own culture is in a process of being supplanted by the other, a 
conscious preservation effort or defense is often produced. Such 
reactions have been called nativistic endeavours or revivals: (see 
Kroeber 1968:437) I shall in the following paragraphs attempt to 
examine the nativistic tendencies which contributed to the sustenance 
of the present diglossia in Sinhalese. 

Sinhalese has lithic records dating from as early as the third 
century B.C.2 The earliest extant literary work, however, belongs 
to the ninth century A. D.* even though there are references to 
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earlier works.1° Although it has been suggested that the language 
of the inscriptions was closer to the spoken language than to the 
literary, there is no evidence to support such a thesis. The inscriptions 
of the ninth and later centuries have been written in a grammar, 
identical to the grammar of the literary works belonging to the 
respective periods. If this similarity is significant it may be deduced 
that the grammar of the inscriptional language was hever different 
from that of the language of literature. The inscriptions and the 
available literary works from the third century B, C. to the fifteenth 
century A. D. show that the literary language has gone through 
several stages of evolution: this is to be expected under normal 
circumstances. The grammar of the literary language of the present 
time, however, is in no way different from the literary grammar of 
the thirteenth and fourteenth centuries. This ‘stand still’ or 
‘retardation’ is, as will be seen from the following discussion, a 
product of the puristic tendency inherent in nativism. 


Until recently Sinhalese literature has been characterised by 
a clear-cut division of prose and poetry, producing now an era of 
prose literature and now an era of poetry ina mutually exclusive 
manner. In the fifteenth century one such period of prose literature 
ended and the literatteurs took up poetry for literary composition. 
Not long afterwards European invasions started with the coming of 
the Portuguese in 1505 A.D. Due to the internal turmoil that 
resulted from these political events, all forms of creative art ceased 
to function and no worthwhile literary works were produced for at 
least three centuries. The only writings of the sixteenth and seven- 
teenth centuries are a set of war ballads which, although historically 
important, are of no literary merit. With the cessation of prose 
writing in the fifteenth century, and of all forms of literature since 
the invasions, the elite lost their hold on the language until the 
beginning of the first revivalist movement of Valivita Pindapatika 
Asaranasarana Saranankara Sangharaja (1698-1778). Ishall refer 
to this movement as the first stage of nativism and purism. 


The administrative documents written during the period of 
literary inactivity contain a grammar which is very close to the 
grammar of the present spoken language but very different from 
the grammar of the pre-fifteenth century literature. The following 
sentences from an Agreement signed by a Britisd Governor and the 
members of the Sinhalese court, may in this connection, be com ared 
with its modern spoken equivalent that follows. : 


Sinhala vasiyo novana meratavalvala hitina nohot ehi 


d . il ili . Py . . enda 
yedena sivil milisiya sama ayagema sivila saha kriminel 


nadu 
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asima angalante mahardjottamayanan vahansege ajnaven heva 
pannakkarayanne kramayata yatat ya. hevapannakkara mul- 
adani saha heva pannakkarayoda heva samudava katuva 
enavunda ara anikut simadenadma minimarum karanaval ara vena 
sama karanavalvala dima ingirisi govarnaméntuve ejant yana 
nilamakkaratan nohot nilamakkarayinge naduvaia yatat ya.6 


Sinhala vesiyo novana meen ratovalvola inns nettan eevata endo 
yedens sivil milisiye hema ayagemo sivil naduy kiriminel naduy 
ehiima engolante maharaajoottomayaananvahansege aajiiaaven 
heevaa pannakkaarayanne korometa yatat. heevaapannakkaara 
mulaadeeniy heevaa pannakkaroyoy heevaa hamudaavat ekke 
enovunuy era anikut hemodenaamos miniimerun kaaronaaval zero 
anit hema kaaranaavalvoladima ingirisi aanduve eejant kiyono 
mulaadeeniyage naduvato yatat.’ 


The original is grammatically different from my rendering of 
it into the spoken language only in the use of ya at the end of each 
Sentence and the functors nohot, saha, -da (e. g. enavunda) yana and 
-hi (e.g. ehi). The Sangharaja’s own writings too were grammati- 
cally distinct from the language of earlier literature in a similar 
manner. if the grammatical tradition of the administrative documents © 
and the writings of the Sangharaja were accepted by the subsequent 
scholars the present disparity between the colloquial and literary 
languages would never have arisen in such magnitude. But that 
grammar was incompatible with the nativistic endeavours. 


The Sangharaja’s revivalist movement produced a considerable 
number of scholars with a good command of the classical works.8 
This pupillary succession is considered so significant in the country 
today that the study of it constitutes a good portion of the Sinhalese 
literature syllabus in the Ceylon Universities. These scholars were 
obsessed with the usual revivalist temperament which accepts 
axiomatically that revivalism, to be meaningful, must aim at re- 
introducing the characteristics of the Augustan age. It is customarily 
acclaimed even at the present time that the Augustan age of Sinhalese 
literature was in the thirteenth and fourteenth centuries: Amavatura, 
Busarana and Saddharmaratnavaliya constitute the Augustan model.9 
Even though the Safigharaja’s own writings were characterised by 
non-classical grammatical features his pupils thought it more 
expedient to copy the grammar (and style) of such modelsif a true 
literary revival was to be brought about. 

Consequently, the Augustan grammar had to be taught 


through prescriptive, pedagogical handbooks. When it became 
necessary to prepare hand books of classical grammar, scholars took 
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as their guide the thirteenth century compendium called the Sidat 
Saiigarava, notwithstanding the fact that being a Pourastive” state- 
ment of the language of prose and poetry, the Sidat Sarigarava did 
not provide all the grammatical rules observed in prose writings. (see 
DeSilva 1965*). The nominal and verbal inflections were, nonetheless, 
adequately discussed, and with that as the basis and using classical 
literary data a large number of handbooks were written. K.N.O. 
Dharmadasa has collected some fifty-eight such titles, a good number 
of which have been written very recently (Dharmadasa 1967*) The 
‘Sinhalese language’ periods in school time tables were, and still are, 
devoted solely to the teaching of the literary grammar. 


The Sangharaja Renaissance, the aim of which was to reinstate 
the Sinhalese culture, Buddhism and the Sinhalese language in their 
pristine purity resulted in the establishing of a number of seats of 
learning which gained international reputation for Classical Indic 
studies (Sarathchandra 1950*:40-3). Owing to the nature of the 


government at the time the study of oriental languages was not 
vocationally very rewarding: vernacular teaching which was the best 
occupation available after such studies was rated far below English 
language teaching as well as all forms of teaching which were done 
in the English medium. Indic languages, mainly Sinhalese, Pali and 
Sanskrit, were studied in monasteries and vernacular schools, and 
were read by Buddhist monks, Ayurvedic physicians, and vernacular 
teachers. These three classes of people still constitute the core of 
nationalism in Ceylon.1° The nature of the campaign conducted by 
these people in their feeling of subservience owing to foreign political 
and cultural domination and their desire to preserve the true heritage 
of the nation is evinced by their propaganda literature and the 
various mewspapers they established. Owing to the nationalist 
sentiment that began to prevail all writers adhered to the literary 
grammar lest their endeavours would be misconstrued as unpatrotic, 
The Sinhalese reading public being comparatively small and the 
preservation tendency being the guiding factor, ample scope was 
provided for a high language to come into being. The choice of the 
thirteenth and fourteenth century form of language was due to the 


reputation of that period as the zenith of cultural 


activity in the 
country. 


This first stage of nativism was follo 
during which the language of the novel took 
popularisation of literature asa result of t 
Subsequent to the period of literary and 
(second half of the nineteenth century), 


wed by a more lax period 
a different shape. With 
he advent of the novel 

religious controversies 
Which was an important 
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land-mark in the development of the language of Sinhalese literature, 
some convergence was beginning to appear. 


E. R. Sarathchandra refers to this period as 


‘a period of controversy which lasted for about half a century, 
the most important outcome of which was it rendered the 
language a more plastic instrument for the use of the writers of 
prose fiction, who appeared at the beginning of the twentieth 
century.’ (Sarathchandra 1950*: 41-2) 

He observes that 


‘between the time of the last works of the classical period and 
the controversies, the language had undergone many changes, 
both grammatical and otherwise and some of the older 
controversies provide us with examples of the earliest attempts 
to write in the unsettled idiom of the day.’ (Sarathchandra 
1950*: 45) 


This period contributed to the breaking down of the rigid 
difference between the spoken and literary usages, and had it not been 
for the second stage of nativism which will be discussed below the 
disparity would have continued at a very low scale. The most 
important contribution of this polemical period was the committing 
of the spoken language into writing: except fora very small number 
of grammatical features, such as the sentence-final-ya and certain 
special particles, which must have been included by the reporters 
under the influence of the existing literary language, the special 
literary features observed during the period have been completely 
ignored. Compare this excerpt from the Savsatdam Vadaya with 
its literary rendering that follows: (The literary rendering is mine.) 


bohd pandita terindek dhammatilaka unnansege kima hatiyata 
saradam kia vat yatalaba safigarakaruge kima hatiyata saram 


kiya vat yedennéma nata kiya maha garu gambhirayakin karatota 
nayaka sthavirapadayan vahanse sdduvava kaviyaka_ kiya 
kaviyakut genahara dakva degollagema kimata viruddhavoa sarada 
mati vena bava kiya tibenavaya. samaharavita € kima visvasa 
karagena ema kaviya 4m ahgé liyamané varadava prasiddhakara 
tamagéma praydjanayata haravagena tibenavaya. numut ema 
kaviya yatalaba sangarave accugasa tibena vidhiya sibaé nam 
saradam karyo varadivak men ma saram karayot varadi ya. 11 
boho ugat teri#daku dhammatilaka unnanségé kima prakara 
saradam kiya h6 yatalaba savigarakarugé kima prakara saram 
kiya ho no yedémavayi mahat garvayekin karatota nayaka 
sthavirapadayan vahansé visin sadana lada kaviyakayi kaviyak da 
genahira dakva depaksayéma kimata viruddhava sarada mati vana 
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bava kiya tibé. samaharavita é kima piligena ema kaviya aoe 
liyamanehi varadava prasiddha karagena tamagema prayOjanayata 
haravagena tibeé. ehetema kaviya yatalaba sa7igarave mudranaya 
kota tibena akaraya sibo nam saradam mataya gattan menma 
saram mataya gatto da varadi ya. 


Prior to the period of controversies the story writers. used a 
uniform grammatical pattern for both narratives and dialogues. 
The use of a uniform pattern in this manner was the classical norm 
in the treasured Augustan works, which, incidentally, were all story 
books. With the recording and printing of the controversies, 
however, the writers realised for the first time the forcefulness of 
the spoken idiom in the communication of the themes presented in 
fiction. The introduction of the modern novel asa medium of 
literary expression coincided with this discovery. With the intro- 
duction of the novel, the literature took a further turn in that it 
began to be focussed upon a wider reading public. The concept of 
mass education towards a political independence was an added 
impetus in this direction: the masses had to be taught the nature 
of Christianity (as against the virtues of Buddhism), the evils of 
colonialism and the excellence of Sinhalese morality. The early 
novels were written with these didactic motives. (see Sarathchandra 
1950*: eal) 


The distinctive characteristics of the literary language, which 
were maintained and rigidly upheld in every sentence throughout the 
period of scholastic literature was gradually abandoned in the 
dialogues and conversations in these novels. The narrative continued 
to be written in the classical grammar and the dialogues in the 
grammar of the spoken language. The fact that the high literary 


forms of grammar ceased to be the only pattern usable on paper 
was a Singnificant change. 


It must be emphasized that this change was a gradual one. 
The early novelists like Piyadasa Siriséaa (1875-1946) (Sarathchandra 
1950*: 92-112) generally followed the classical tradition of not 
distinguishing between narratives and dialogues, but one finds in their 
works an occasional dialogue written in the grammar of the spoken 
language. The following excerpt from Siriséna’s Jayatissa saha 
Rosalin is a part of a dialogue written in the classical grammar: 


prémiya, numba atissayin prabédha vivak 
es€ vimata karanaé kavaréda? kumudu tarak 
namut candralokayata visesayen pr 
pamineti. 


men hafigenniya. 
a eliyatat pipeti. © 
abddha vi SantOSayé simavata 
emen oba namiti candraya dakimen magé sita namati 
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kaumadaya min prathamakisi dinaka nopaminiyava pritiyata 
paminenniya. 


Compare this, however, with the following excerpt from the same 
author’s 


Sucaritadarsaya: 


katayutu hungak durata gonuvuna da? magé kantoruva 
balakaraganimin vun kol!a iyé kélamak kiya asuvi mama elava 


damma. mata kollek natuva ada kantéruva atu give vat naha. 


With the expansion of the novel as a form of art without being 
limited to didactic function, the later novelists developed the 
grammatical distinction between the narrative and the dialogue which 
resulted in the emergence of the general pattern obtaining now: all 
sentences written within quotation marks are given in the grammar of 
the spoken language while the other sentences are given in the literary 
grammar. In spite of the polemical literature no attempt was made 
by the writers to alter the grammar of the narrative sections of their 
works: this, the persisting nativism did not allow. That the grammar 
of the colloquial language was accepted for printing, even though in 
a limited way, must have, however, contributed in some way to the 
emergence of hybridisms in literary usage. 


The introduction of the novel and the short story increased 
the reading public and the number of writers tremendously. Some 
young writers who did not have a profound mastery of the classical 
grammar began to write in what I have referred to as hybrid varieties, 
so that the domain of literary grammar became lax. This process 
was at least partially enhanced by the popular press. The newspapers, . 
while maintaining a grammar akin to the classical, have violated some 
of the morphological and syntactic rules laid down in the prescriptive 
graramars. The accusative singular suffix -aku of the older times has 
been changed to -eku. The -navaa form of the verb, which is strictly 
a spoken one, has come to be written in interrogative expressions like 
yanava da ‘(are you) going?’ An old rule requiring the -ek ending 
of the indefinite nominative form of inanimate nouns has been 
changed, and the more common -ak came into vogue: thus potek 
‘a book’ has come to be written as potak in conformity with the 
spoken form. Although these and a few other changes that have 
come to be accepted through the novel and the press did change the 
actual literary language somewhat, the classical nature of the grammar 
did not disappear altogether. 


The second stage of nativism which arose as a counteraction to 
this tendency towards hybridism has been instrumental in the retention 
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of the old grammar up to the present day inspite of the innovations 
of the novelists and the journalists. During this stage nativism 
strengthened its stronghold on schools and reinforced the teaching of 
classical grammar in the classroom. The movement was headed by a 
popular teacher, referred to by his adherents as gurudevi ‘god among 
teachers’, namely, Kumaratunga Munidasa. Munidasa was a man of 
great learning in Sinhalese, Pali and Sanskrit. During the earlier 
part of his life he wrote in a language similar to that of the Butsarana, 
but later become a proponent of the ‘pure Sinhalese’ movement 
which still campaigns for the use ofa pure variety of language 
which is devoid of Sanskritisms. It must be emphasized that the 
non-Sanskritic, pure language they advocate even antedates the 
period of the Butsarana. Munidasa’s movement, known as the 
Hela Hayvula ‘Pure Sinhalese Group’, has a strong following in the 
country, and one of the functions of this group is to demolish writers 
who violate the classical rules of grammar in their narratives: the 
language of dialogues, however, is generally not criticised. Munidasa’s 
instructton to his group is embedded in the following translated 
quotation from his Vyakarana Vivaranoya (Munidasa 2492 B. E.*): 


‘Nowadays some people seem to think that grammar is irrelevant. 
To him who suffers from indigestion, food is indeed a nuisances 
From the primitive hunter’s point of view, clothes are only 
things to laugh at. When one looks at things this way, it isno 
wonder that there are men who hate grammar. In civilized 
Society, hower, language needs grammar. If there is 
permission to violate the law, it will be to the mirth of the 
criminal. If, for the happiness and comfort of the criminal 
social laws were allowed to be violated, civilisation would besin 
to disappear straightaway. If there were permission to violate 
language rules, the ignorant ones would certainly be happy....... 
It would provide a way to conceal their ignorance..... In this 
country, alas, ignorance is erudition; knowledge is a thing to 


ridicule. An attempt must urgently be made to remedy 


: : this 
Situation,’ 


Notice the puristic definition of the term < ; 
‘grammatical rules’, With such campaigns in defence Cr ch oe 
grammar based on the classics, the move towards bridging age V 
between the spoken and literary languages was s'’owed down, and gap 
effort made in that direction was reve:s:d The Hela ee ; me 
holds annual convocations all over the country to ioe ; = 
anniversary of Munidasa’s death on 2nd March, but more si ae — 
to demolish non-conformisms that might have co ae 
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writings of the year. It is interesting to see how certain members of 
the academic circles who denounce this puristic endeavour as 
irrelevant to the development of the literature, still write their 
objections to it in the literary grammar lest they should be branded 
as ignorant of the rules. Munidasa and the Hela Havula differ from 
the other purists only in the use of derived words as Opposed to the 
Sanskritic lexicon. All groups advocate the same grammatical rules. 
The differences between the writings of the adherents of the Hela 
Havula and those of the other puristic groups are therefore a matter 
of phonology and grophology. The advocates of Sanskritc loans are 
numerically stronger but the Hela Havula movement is a more conso. 
lidated effort to preserve the classical norm. Between these two types, 
the revolutionary inevitably gets demolished. A certain amount of 
opposition to the mainténance of the diglossia hes been published by 
some three scholars and they seem to have gained a small following.12 
Inspite of this innovation, the allegiance to the classical norm stiil 
prevails Such organizations as the National Language Department 
and the Academy of Letters which are sponsored by the Government 
of Ceylon contribute to the maintaining of the rigid rules of the 
literary language. 


(3) Effects of redundancy on hybrid formation: Section 1 
above outlines the manner in whicha number of hybrid varieties 
arose in the Indic family for literary purposes asa direct result of 
the convergence of high (Sanskrit) and low (vernacular) varieties. 
Section 2 which deals with the history of the Sinhalese situation 
mentions in brief a number of hybrid literary varieties, to wit, the 
writings of the Sangharaja, the colonial administrative documents, 
the language of the novel subsequent to the polemical period 
referred to by Saratchandra as ‘the unsettled idiom of the day’, and 
‘the corruptions’ or violation of the rules by young writers which 
are under constant attack by the purists. The pattern of hybrid 
formation is similar in both instances. In my view, the primary 
cause of this kind of hybridism is the factor of redundancy which 
characterises the high variety in comparison with the low. Before 
discussing the modern Sinbalese situation at length, therefore, i 
propose to discuss the manner in which redundancy operates in 
hybrid formation. (See also De Silva 1970.) Convergence, which 
creates all forms of creoles and hybridisms, is the result of what 
J. Lyons (1968 : 90) refers to as the warfare between two opposing 
tendencies in languages, one to achieve efficiency and the other to 


achieve understandability. 
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It is widely known that High Literary Sinhalese differs from 
Colloquial Sinhalese in both phonology and grammar, although 
these differences may be handled as surface characteristics only.1 
The general tendency is not to use the grammar of the colloquial 
variety in serious writings. I shall attempt to show how the special 
characteristics of the literary language may be interpreted as redund- 
ancy phenomena with reference to the colloquial variety of the 
language. 


Viewed in its own rights Literary Sinhalese hasa case for 
being treated as having the categories of gender, number and 
person as syntactically relevant categories. From a structuralist 
standpoint, an independent analysis of the spoken language does not 
allow for stating these categories in that way These are only some 
of the categorical differences between the two varieties. But there 
is an important point to remember which is very relevant to the 
present thesis: although Literary Sinhalese is never spoken except 
when reading, even the most illiterate speakers of Sinhalese are able 
to listen to and understand the language of, say, newspapers, without 
being impeded by the grammatical peculiarities of the type mentioned 
above. In some one thousand tests conducted with a view to 
Observing if illiterate people misunderstood literary texts (from 
newspapers and novels) because of the grammatical peculiarities 
involved, I had no clues whatsoever to prove that these peculiarities 
impeded understanding. Notice that I leave lexis untouched lexical 
differences are many between any two varieties. Any lack of under- 


Standing in my test cases was due to lexical difficulties: once more 
common lexical items were used to replace the more Sanskritic or 
unknown ones, the purport of the texts was understood by every 
person tested. They were able to answer questions based on the 
passages read to them not more than two, or at most three, times. 
It is assumed that this intercomprehensibility factor is brought about 
by the redundant nature of the apparently extra categories in the 
literary variety. In establishing Colloquial and Literary Sinhalese as 
varieties of the same language — with the Support 
prehensibility factor and the acceptance by the 


belong to the same language — one, then, treats the extra markers in 
the literary variety as extra exponents of the same Categories rather 
than as distinctly different categories. Such extra exponential 
criteria obtaining in one variety as opposed to another, may be ° 
interpreted as redundancy factors, As a rule, grammatical Categories 
are realized ina larger number of exponents in high varieties and 


of this intercom- 
society that they 
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in a small number in low varieties, In Sinhalese, gender, number, 
person, etc. are multi-exponential in Literary and uni-exponential in 
Colloquial. (See Gair 1968 for details.) Notice. 


Coll. mama yanava Slee * 
Bit: mama yami pol Sys ie 
Coll. api yanava “We so - 
Lit. api yamu “We go” 
Coll. eya yanava “He goes”’ 
Lit. hetema_ yayi *‘He goes’”’ 
Coll. eya yanava **He goes, she goes’’ 
Lit. hetema = yaanneeya “‘He goes” 
aya yanniiya “She goes’’ 


It is because the multi-exponential realization of categories in 
the literary language involves redundancies which are more like 
embellishments than factors relevant for communication that its 
grammatical shape does not impede the understanding of literary texts. 
In treating varieties which are grammatically different on the surface 
as varieties of the sam; language, then, the factor of redundancy is 
stated as a demarcative characteristic. This, perhaps. introduces a 
new dimension of redundancy. The notion that redundancy operates 
at a given level is an old one and has been applied to the study of 
particular languages, particular varieties, at particular times. The 
factor of redundancy is employed here as a mark of variety 
differentiation. In diglossia situations, the apparent access in number 
of the membership of categories or in the exponential criteria, 
obtaining in one variety as opposed to the other which is more 
restricted on those lines are, then, redundancy criteria from a 
communication point of view. 


In treating Literary Sinhalese as characterised by redundancy 
in exponence I have assumed that the colloquial language is the norm 
or the grid in terms of which the literary language is understood. 
Where varieties of the same language are differentiated by redundancy, 
the concept of a grid variety in terms of which varieties with extra 
redundancies are comprehended grammatically seems to be meaningful 
in extreme diglossia situations like Sinhalese: consider my tests 
reported above. 


It seems to me to be correct to assume that in comprehending 
the literary form in languages like Sinhalese, people ignore the 
‘added’ redundant phenomena in that form of the language. From 
this point of view and in comparison with the colloquial language, 
the extra grammatical markers in the literary variety may be regarded 
as superimposed characteristics: that they are superimposed is 
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corroborated by the history of the situation discussed in Section 2. 
These theoretically superimposed and thereby really redundant 
phenomena in the literary language function as the characteristics 
which distinguish Literary Sinhalese from Colloquial at the level of 
grammar (and phonology). Supported by the historical perspectives 
and measured in terms of the grid, Iam tempted to refer to these 
literary characteristics as ‘created’ redundancies. These characteristics 
add little to the grammatical mechanism which the Sinhalese person 
employs in understanding his language. 


Redundancy factor operating between varieties creates 
hybridisms. Just as two language coming into contact with one 
another can result in the creation of pidgins and, later, creoles, even 
so two distinct forms of diglossia can invariably create a third variety, 
or more exactly, a number of third varieties, which constitute pidgin- 
like or creole-like forms, the properties of which are not systematic 
but have to be accounted for with simultaneous reference to the 
varieties which can be described more systematically and which have 
converged to produce these third varieties. This kind of hybridism 
created by people in their attempts to perform in the high variety 
without having gained a full mastery of it is known in Arabic as well 
as in Sinhalese. It is neither the low variety nor the orthodox high, 
and has a tenden-y to fluctuate so heavily that no systematic state- 
ment can be made of it Hybridism is very much linked with 
redundancy. It is the result of attempts to perform in varieties which 
have redundant phenomena in terms of the grid of the individual or 
the community. The process involved in this convergence is that the 
redundancy phenomena are partially observed and partially not, now 
observed here and now observed there, now coinciding with the 
orthodox high and now violating it. Section 4 will illustrate this 
from actual examples from present-day Sinhalese writings. 


(4) Hybridisms in Literary Sinhalese. (a) Phonology and 
Graphology. As has been already stated, the grammar of the literary 
variety is never used in colloquial speech even on the most formal 
occasions. The grammar of the colloquial variety is, likewise, not 
generally accepted for writing except for use between Paptation 
marks in novels, short stories and reports, in verbatim reports very 
intimate letters, private notes, etc. The few revolutionary writersi4 
who attempt to bridge this gap are subject to constant criticism. 


In this way, the literary and colloquial forms of gfammar perform 


two mutually exclusi ioe 
y exclusive functions (in spite of hybridism; see section 5), ° 


All schools of purism are in agreement with this. The situation 
with regard to phonology, however, is somewhat different. Unlike 


ee a a ee 
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the non-observance of literary grammar, to read a literary text 
Without observing all the phonetic characteristics which strictly 
belong to that variety is not regarded as an umpardonable offence. 
In reading, one may use either the strictly literary pronunciation 
involving aspirate stops, palatal sibilants and consonant clusters, or, 
alternatively, employ a pronunciation closer to colloquial in which 
such phonetic qualities do not obtain. With the availability of such 
a choice, frequent fluctuations between the normative literary 
pronunciation and the colloquial speech habits generally take place. 
This tendency towards hybridism, supported by the fact that it is 
not really condemned, often creeps into the graphic form as well, 
resulting in representations which violate the prescribed literary 
norm. The acceptance (or tolerance) of this variability in the pro- 
nunciation of literary texts and the resultant orthographic variability 
is partly, at least, due to the existence of, and mutual tolerance 
between, two schools of purism, one Sanskritic and tbe other anti- 
Sanskritic. Qualities such as aspiration belong to the Sanskritic 
vocabulary which is taboo in the anti-Sanskritic school. This 
acceptance also has a historical reason which is as follows. 


From very early times the Sinhalese language has been regarded 
as having two ‘alphabets’ known as the Suddha Sinhala Hédiya or 
Elu and the Misra Sinhala Hodiya. The former lacks aspirate 
symbols, conjunct consonant symbols and several other Sanskritic 
characteristics while the latter is a transliteration of the whole 
Sanskrit alphabet. These two names may, therefore, be translated 
as the ‘non-Sanskritic alphabet’ and ‘the Sanskritic alphabet’ 
respectively. The Sanskritic alphabet has some twenty-three basic 
symbols and a few conjunct letters which are not found in the 
other15. The non-Sanskritic alphabet was originally used for poetry 
in which borrowing was not allowed and only the historically derived 
forms were permitted1®. Prose, in its ornate form, borrowed 
lavishly from the Sanskrit lexicon, and such loans were faithfully 
transliterated. With the passage of time, however, the alphabetic 
restrictions which characterised the language of poetry penetrated 
into the prose writings of some authors, who, in their puristic 
endeavour, denounced the use of Sanskrit loans in any form of 
writing. The Hela Havula is one such anti-Senskritic group of 
purists. The Sanskritic purists, on the other hand, encouraged 
borrowing from Sanskrit. Prose writings have always been largely 
in their favour. With certain changes in prosody in the fifteenth 
century, Sanskrit borrowings were tolerated in some poetic works, 
but such loans were transcribed in the non-Sanskritic alphabet: the 
aspirates were transcribed with their corresponding unaspirate 
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symbols, all sibilants with the available single sibilant, and so on.? 7 
In my opinion, it is this clash between, and later, the coexistence 
of, two schools of purism that paved the way for the tolerance of 
the afore-mentioned variability in the literary pronunciation. This 
was naturally enhanced by the non-phonemic and embellishment-like 
redundant nature of the special literary characteristics in question. 
If the non-pronunciation of a written aspirate symbol is corrected, 
it is not so much because it is considered sub-standard, but because 
of the general Indian practice and belief that all written symbols 
must be clearly pronounced in reading. The non-Sanskritic alphabet, 
in spite of its original function as the alphabet of poetry, provides a 
writing system capable of representing the colloquial speech 
adequately18. In terms of the colloquial grid, therefore, the 
Sanskritic symbols in the Sanskritic alphabet are graphically 
redundant. That, unlike grammar, the pronunciation habits in the 
colloquial language had such a model for recording them is also a 
significant factor behind this acceptance. 


Although, as mentioned before, the normative grammar of the 
literary variety is never used in colloquial, even in the formal types of 
speech, the situation with regard to phonology is somewhat different. 
Segmental sounds and sound combinations which constitute the 
writing of the normative literary variety may be used in formal 
colloquial; sermons and orations are sometimes characterised by these 
features. Informal colloquial is not lacking in Sanskritic loanwords, 
but such loans have been absorbed into the colloquial phonological 
system which is devoid of such features as aspirations and palatali- 
zation. Between the normative literary extreme on the one hand and 
informal colloquial on the other, there is, because of this divergence, 
plenty of scope for phonological and graphological hybridisms, and, 
indeed, such hybridisms are observed both in writing and speech. 


i In order to study the phonological and graphological hybrid- 
isms in Sinhalese, it is necessary to divide the vocabulary into two 
groups as 


(a) items which do not have specific Sanskritic characteristic 
in the literary variety, and 


(b) items which are characterized by Sanskritic features (such 
as aspiration) in the literary variety. 


The items in group (a) rarely yield themselves to hybridism. 
An occasional intrusive aspiration or palatalization characteristic 
may be observed in reading, or even more rarely, in writing. e.g 
the pronunciation of vidya ‘knowledge’ as [vidhyaa] or writing it 
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as vidhya as I have attested in one instance. Similarly, abhasaya 
‘influence’, pronounced occasionally as aabhaa fayo and written as 
abhasaya. Similarly, the writing pradhanaya and pronunciation 
[prodhaanoyo] for pradanaya ‘donation’. This situation is com- 
parable to the glottal fricative intrusion in sub-standard English, 
although such a social distinction is not associated with the Sinhalese 
intrusive aspiration and  palatalization. Frequent hybridism is 
observed with reference to items in group (b). In measuring these 
items with the colloquial grid, their redundant Sanskritic features 
are partially observed unsystematically, or completely ignored in 
compliance with this grid. Itis this group that I propose to focus 
attention on. 


These phonological observations are based on the reading (and 
writing) of the literary variety. It will be seen that the hybridisms 
in question are due to redundancies represented in terms of lexical 
Sanskritization and alphabetic anomalies (arising out of the non- 
phonemic character of the Sanskritic alphabet). The normative 
reading style includes the reading of Sanskrit words in their 
supposedly original pronunciation, but styles of reading vary from 
the normative to near-colloquial. These non-normative styles are 
hybrid. Although the features involved in hybridism are not 
Systematic, the maximum number of possibilities of hybrid 
formations can be exhaustively stated as in the following 
paragraphs. 


(a) Aspiration and palatalization in hybridism: Although in 
the normative literary variety segmental slots can be prescribed for 
aspirates and the palatal sibilant as specific segmental sounds, in 
accordance with the normative written forms, it is expedient, in 
accounting for hybridisms, to treat aspiration and palatalization as 
properties of the word asa whole. Aspiration or palatalization or 
both, prescribed by the normative literary writing, may, in hybrid 
reading, be observed in any place in the word with the following 
rules: aspiration is realized adjacent to a stop consonant wherever it 
might be in the word or as a divider between units of a diphthong; 
palatalization is realized with a sibilant anywhere in the word. The 
word written as bauddha ‘Buddhist’ has the normative literary 
pronunciation [bauddho]. In hybrid reading, however, aspiration 
may be observed with b or d or both or splitting au: thus, [bhauddo], 
or, [bhauddhoa], or, with vowel division, [bhahuddho], [bhahudda], 
or, more frequently [bahuddho]. Likewise, mansapesi ‘muscle’ which 
has the normative pronunciation [maansopee fi] is pronouced in 
hybrid forms as [maay foapeesi] or [maayfopee fi]. Both these 
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words are, of course, pronounced at times strictly in terms of the 
grid, with no aspiration or palatalization so that [bauddo} and 
[maansopeesi] are often heard in reading. Correspondingly, written 
forms bhaudda and manga have been attested. A few more examples 
of phonological and orthographic hybridisms are given below. 


I Hybridism with respect to Aspiration: 


(i) Normative written form: akhanda ‘incessant’ 
Normative pronunciation : [akhanda] 
Colloquial pronunciation: [akanda] 

Hybrid pronunciations: fakandhol, fakhandho] 
Thy Hybrid writing akandha has been attested. 


(ii) Normative written form: apathya ‘disagreeable’ 
Normative pronunciation : [apathyo] 
Colloquial pronunciation : [apattio] 
Hybrid pronunciations : [aphatyo] [aphattio] 
The Hybrid writing aphatya has been attested. 
Il Hybridism with respect to palatalization : 


(i) Normative written form: niraksaya ‘equator’ 

Normative pronunciation: [nirak foyo] 
Colloquial pronunciation: [nirasse] 
Hybrid pronunciations: [niraksaya] [nirassayo] 
The Hybrid writing niraksaya has been attested. 

(ii) Normative written form: visvasa ‘trust’ 
Normative pronuciation : [vi f'vaaso] 
Colloquial pronunciation : [visvaaso] 
Hybrid pronunciation : [visvaa fa] [vi fvaa fa] 
The Hybrid writings visvasa, visvasa have been attested, 

III Hybridism with respect to aspiration and palatalization : 


(i) Normative written form: adharmista ‘wicked’ 
Normative pronunciation: [adharmi/te] 
Colloquial pronunciation : [adurumisto] 
Hybrid pronunciation: [adarmiftho], [adarmistho] 
The Hybrid writing adarmistha has been attested. 
(ii) Normative written form: @$carya ‘miraculous’ 
Normative pronunciation: [aa fcarya] 
Colloquial pronunciation: [aascaria] 
Hybrid pronunciation: [aa fcharyo] [aascharya] 
The Hybrid writing ascharya has been attested. 


Many more examples of such hybridisms have been observed. 
The pronunciation [buddahy saranan gaccahaami] for the Pali 
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buddham saranarn gacchami ‘I take refuge in Buddha’ is often heard 
from Buddhist monks in their discourses. The intrusion of aspiration 
in words which do not have aspiration in their Sanskritic form has 
already been mentioned with reference to examples like Skt. 
pradana ‘giving’, Sinh. pradinaya ‘endowment’, often written and 
pronounced as pradhanaya [predhaanoya] by many people. (It is not 
my intention to give unduly long lists of examples.) 


(b) Matra-bound hybridisms : Another kind of word 
hybridism in Sinhalese is the tendency to write final long vowels as 
short. Thisis done under the influence of the colloquial speech 
in which the permissibility of long vowels in the final syllable is 
conditioned by the miatra** structure of the word. Normative liter- 
ary usage, however, allows both short and long vowels at the end 
of every word structure irrespective of the matra composition of the 
word. 


There are no single-matra words in Sinhalese: monosyllabic 


words are cv, cvc or cvc, but never cv. All open monosyllabic words, 
thus, have along vowel. This applies to both literary and colloquial 
words like ra [ree] ‘night’, /i [lii] ‘wood’ are common to both 
varieties. The contrast under discussion obtains in polysyllabic 
words. In normative Literary Sinhalese, however, long vowels are 
written and pronounced at the end of words irrespective of their 
length in terms of the number of syllables. In Colloquial Sinhalese, 
however, length and shortness are mutually exclusive in certain 
structures, and this depends on the total matra count of the non- 


final syllables. 


In disyllabic words, for instance, the permissibility of a long 
vowel in the final syllable is determined by the matra structure of 


the first syllable as follows: where syllable 1 is cv, and therefore 
one matra, syllable 2 may be one matra or two matras: this allows 
syllables of cv type at the end of such wards. If syllable 1 is cv, 


cve or cvc, and thereby contains two méatras, syllable 2 is never 
cy : open second syllable does not permit a long vowel. Thus, paré 
‘on the road’ kappa ‘cut’ and varta ‘reports’ occurring in the nor- 
mative writing, with the normative pronunciation [paaree, keppuu, 
vaartaa] have the corresponding colloquial pronunciation as [paare, 
keppu, vaarta]. The generalized structures of disyllabic words with 
final open syllables are given below in order to focus attention 
on the difference between Literary and Colloquial Sinhalese : 
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Literary Colloquial 
syll. 1 Spl. 2 syll, 1 syllvé 
cv cv cv cv 
cv cv cv cv 
cy cv 
cve ev cve cv 
CVC cy cvC 


A similar contrast obtains in trisyllabic words. While the 


literary variety has no restriction on the use of cv andc?y in 
the final position, the colloquial variety, owing to the fact that 
the first two syllables add up to more than one matra, only 


allowscv. The permissibility of cy syllables at the end of words 
with four or more syllables is also associated with stress. Quadri- 
syllabic words with one major stress (which in Sinhalese falls on 
syllable 1 or 2) behave like trisyllabic words in that while the 


literary variety allows cv and ci at the end, the colloquial variety 


only allows cy. Where a polysyllabic word is characterised by two 
major stresses, the permissibility of final c¥ in colloquial is deter- 
mined by the nature of the second stress group. Generally, if the 
stresson the penultimate syllable is heavier than that on the 
antepenultimate, the last two syllables constitute a stress group which 
behaves as if it were a disyllabic word for the purpose of allowing 
cy at the end. If the stress on the antepenultimate is heavier than 
on the penultimate, then the last three syllables constitute a stress 
group which behaves as if it were a trisyllabic word. These restri- 
ctions, as mentioned before, do not apply to the normative literary 
usage. 


Because of this difference, and because of the redundant nature 
of final long vowels in Literary Sinhalese when measured with the 
grid variety, there isa tendency among some people to write nor- 
matively final long vowels indiscriminately as short or long and to 
pronounce them thus, thereby creating orthographic and phonological 
hybridisms. yanava ‘going’ written as yanava, maluva ‘fish’ written 


as maluva, vandura ‘monkey’ written as vandura are among such 
hybridisms I have observed. 


(c) _Hybridism with respect to svarabhakti: In accordance with 
the Sanskrit tradition, normative Literary Sinhalese permits word- 
initial consonant clusters involving y, v, rand i. Thus 


eS tyaga ‘gift’ 
§vasa ‘breath’, Kramaya ‘method’, klanta ‘fatigue’, stn 


The written 
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vowel r being pronounced as if it were ru, such words as prthiviya 
‘earth’ may also be taken together with the r- clusters. The colloquial 
speech, in its informal use, allows no clusters of. this type and asa 
result the reading of normatively written clusters tend to involve 
Svarabhakti splits with intrusive vowels. The near-colloquial pro- 
nunciation of the above words is [tiyaago], [suvaaso], [korome], 
[kalanto] and [purutuveiyo]. 


The writing of Sanskritic krara ‘wicked’ as kriira as well as kira; 
vrkaya ‘wolf’ vrkaya as well as vurkaya; vrta ‘circumference’ as vita as 
well as vurta; klantaya ‘dizziness’ as klantaya as well as kalantaya; 
mlana ‘withred’ as mlana as well as malana; nyaya ‘method’ as nyaya as 
Well as niyayaare all cases involving hybridism owing to the above- 
mentioned disparity between the normative literary writing and the 
colloquial speech. The writing of the word for ‘quinine’, normatively 
written as kvinin, as both kvinin and kuyinin may also be regarded 
as a Similar example. 


(d) All polysyallabic words which, in Literary Sinhalese, 
end in -aya have the corresponding form in colloquia] with the 
ending —e (or €) according to the matra structure of the word. Under | 
the influence of this colloquial form, many -aya words are often 
written alternatively with -é: e.g. detané for detanaya ‘breasts’ 
bavé for bhavaya ‘existence’, sagaré for sagaraya ‘ocean’. 


The examples given above have all been attested in recent 
writings. This undiscriminating writing is due to the convergence 
of the two varieties of the language which results in modifying the 
literary normative forms in terms of the colloquial grid. That these 
and other examples are often regarded as wrong is not very relevant 
for the present purpose. What is relevant is the manner in which the 
standard literary pronunciation has broken down resulting in graphic 
changes. Some of the above examples have, ironically enough, been 
taken from works written by purists: even purists seem to allow such 
deviances unconsciously especially in children’s literature. 


(5) Hybridism in Literary Sinhalese (b) Grammar: Although 
the grammar of Colloquial Sinhalese is not used in literary works 
asarule,a few writers have, in a campaign against this compart- 
mentalization of varieties, written some novels entirely in terms of 
colloquial grammar. This innovation, nonetheless, has only a small 
following and the general tendency to keep the two grammars apart 
still persists. Owing to the redundant nature of grammatical 
markers in the literary variety many deviances take place in modern 
writings. Such violations are far too many to ignore as incorrect 
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usage. Treating these deviations from the norm as hybridisms I 
propose in this section to account for the occurrences in literary 
works. The grammatical differences between literary and colloquial 
varieties have been adequately dealt with by Gair and need not be 
repeated here. Gair himself observes this convergence of literary 
and colloquial varieties, when he comments: 


‘If we now turn to look again at current Sinhalese as a whole, 
attempting to classify it exhaustively according to the two 
varieties and their characteristics... there will be a sizeable re- 
mainder that does not fall cleanly into either variety. First of 
all, and least important, there will be the occasional refusal of 
colloquial habits to remain suppressed in an otherwise Literary 
passage. Setting aside such sporadic back-sliding from the- 
artificial conventions of the Literary variety, there remains a 
significant body of material blending the characteristics of both 
varieties in a less haphazard fashion. In much of it, a ccnsistent 
pattern can be discerned.’ (p. 8) 


Commenting on these consistencies, Gair continues, 


‘Main verbs are in their Colloquial form, and the marking of 
other predicators generally accords with Colloquial usage rather 
than with the principle of full predicator marking. However, 
one and usually more of the structural features of Literary will 
be observed with full or relative consistency, so that there may 
be Literary case forms, the observance of the accusative rule, 
specific literary constructions such as the lebenova passive, and 
the use of Literary verb forms in some or all subordinate con- 
structions, Literary functors are also an important and very 
common characteristic.’ (p. 8) 


What Gair considers ‘least important’, namely, ‘the occasional 
refusal of colloquial habits to remain suppressed in an otherwise 
Literary passage’ is too frequent a phenomenon in modern writings 
to be regarded as less significant than any other blend. These, 
indeed, are hybridisms in the actual literary usage, never observed 
in formal colloquial. Gair observes a substantial degree of consi- 
stency in the convergence that produced what he names Formal 
Spoken. The only consistency (if it is a consistency) in this variety 
(or bundle of varieties) is the use of the colloquial finite verb inflection 
when these hybird sentences are used in Formal Speech: when used 
in writing even this degree of consistency disappears. There can be. 
no consistency where a multiplicity choices is available at any given 
functor slot in the Sentence ~ranging from normative literary to 
colloquial, every functor selection is made in a haphazard and 
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unpredictable manner renderiny futile any effort towards making a 
systematic statement of the variety. On these grounds I treat both 
his types on par, giving them the same status that any hybrid variety 
deserves; in both types the possible deviations from the norm are 
exhaustively statable but no criterion for functor selection is predict- 
able. Formal Spoken and deviant Literary are of course two distinct 
types of hybridism insofar as the former employs the colloquial finite 
verb inflection, while all other grammatical markers remain un- 
predictable, and in the latter no marker is predictable although more 
often than not a literary finite verb inflection is employed. 


As an example of Formal Spoken, Gair gives the following 


sentences: vasana ata enu dutu goviyan pritiyen kagasanna patan 
gatta ‘the farmers, who saw Vasana coming from a distance, 
began to shout for joy.’ 


As a spoken sentence this will always retain the finite verb 
form -gatta. But, with the availability of choices at the various 
functor slots, more than adozen sentences are possible of which 
Gair’s example is only one. Notice the possible permutations: 


: enu goviyan kagasanna 
vasana ata dutu ...4 kagasanta patangatta 
enaya goviyo Kagasannata 


If this were strictly a written hybrid sentence, never used in 
formal speech, then there are at least three possible choices at the 
finite verb slot as well. 


patangatta 
patangat ha 
patangattoya 


Where a high variety and a low variety converge, blending 
takes place with respect to both domains of language use, although 
in this paper I focus attention only on the hybridisms in the literary 
usage. As examples of such hybridisms I give below a few examples 
taken from modern texts and explain the blending involved in each. 


1. bhasavaka ratava poliskarayaku nové ‘Pattern of language is not 
a policeman’. 


The normative literary equivalent should read...polis karayek 
nove; the colloquial equivalent is . polis kaarayek nevi. Literary 
grammar makes a distinction between nominative and accusative 
inflections in noun declension which gives poliskarayek, nom., 
and poliskarayaku, acc. The distinction is redundant with 
reference to colloquial, in which the form poliskaarayek equates 
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with both forms in literary. In hybrids, therefore, an indiscri- 
minate use may be made of the two forms. The choice of 
poliskarayaku in this example may be the result of an | attempt 
to give an exclusively literary form of the noun In question. 


.. asavan saha uvamanavan apata sihina magin pene ‘Desires and 
needs become visible to us through dreams. 


The normative literary equivalent should read asdvan saha 
uvamanavo...peneti. The colloquial equivalent is aasaayaluy 
uvamanaayaluy...peenava. As shown in example 1, nouns are 
inflected for nominative and accusative: asavO, nom. pl. asavan, 
acc. pl. uvamanay6, nom. pl. uvamanavan, saha ‘and’ must be 
in the accusative case. The equivalent plural forms in colloquial, 
where nom acc. distinction does not obtain, are aasaaval and 
uvamanaaval. (Repeated -uy is the conjunction). While the 
verb does not inflect for number in colloquial, it does in literary 
so that pené, sg. and peneti, pl. The choice of uvamanavan for 
uvamanavo and of pené for peneti is due to imperfect 
observation of the redundancy characteristics. 


3. sitata pidavak dena siddhin sihi kirimata apa asatutuya 

‘We do not like to recall events that disturb our mind’. The 
normative literary equivalent should read ..api asatutu vemu; 
the colloquial equivalent is... api asatutuy. In literary, the 
first person plural pronoun has two forms, api in the nom. and 
apa in the acc. Colloquial has one form api -ya is the nearest 
literary equivalent of colloquial -y in adjective predication. 
Notice the choice of apa to give extra literary character but the 
use of -ya which is nearer colloquial than vemu, the first person 
plural affix normatively prescribed. 


4. dostaramahatun sanskarakavarun lesapatkarini ‘Doctors were 
appointed as editors’. 


The normative literary equivalent should read dostara mahatvaru 
patkarunu; the colloquial equivalent is dostaramahatvaru.... 
patkeruva. As in the previous examples, the choice of the 
accusative form...... mahatun may be explained as an effort to 
make a literary form different from the colloquial form. The 
use of the singular verb patkarini insted of the required plural 
form patkarunu is due to the absence of a number distinction 
in the colloquial verb,which allows an arbitrary choice. 


(6) Conclusion : The foregoing description of the nature of 
convergence obtaining between Literary and Colloquial Sinhalese 
may not be typical ofall diglossia situation although it is highly 
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orobable that some kind of hybridism may obtain asa result of 
nterference between any two varieties. In diglossic languages like 
Sreek where the same lexical item may have two different meanings 
n the high and low varieties the mount of hybridism may be com- 
aratively small, for the learning ofthe high variety in such languages is 
antamount to learning an entirely different language (although L,-Le 
nterference characteristics are liable to operate in such languages as 
vell). Sinhalese presents a situation different from Greek, etc. 
nsofar as the high-low distinctions are due to superimposed phono- 
ogical and grammatical features which constitute a very different 
limension of redundancy. On account of this superimposed 
edundancy phenomena it is possible to formulate a set of rules by 
he application of which the colloquial variety may be transformed 
nto the literary variety : the vocabulary used in Literary Sinhalese 
loes not contain any item which has a different meaning if it were to 
ye used in the colloquial (subject to the necessary phonological modi- 
ications). The convergence under discussion is the result of the 
mperfect manner in which these superimposed redundancies are 
bserved. That the extra noises heard in listening to a literary text 
o not impede comprehension poses an interesting problem in lingui- 
tic theory which I do not propose to discuss in this paper; 
- has some contribution to make to learning theory as well as to 
uch sociological questions as role acquisition. 


I hope I have demonstrated the value of historical observation 
1 explaining and describing varieties differentiation. The extent of - 
ybrid formation or blending is dependent upon the historical 
ackground to the creation of varieties distinctions in the first 
stance, and although interference between languages as well as 
etween varieties is normally expected, the degree of interference 
nd permissibility of blends between the varieties of a given language 
t a given time is conditioned by the history of the situation. It will 
ave been seen that the general feeling that, where languages have 
iodel targets, spread of education brings low varieties closer to 
1e ideal model, is not entirely true. The emergence of non-nor- 
ative literary grammars discussed ia this paper are indeed a result 
f mass education and the popularisation of literature. That spread 
f education may result in a reversal of the expected cline towards 
le high variety is amply demonstrated in Sinhalese in the efforts of 
‘me educated writers to bridge the gap between the colloquial 
ammar and the grammar of the normative literary variety by 
yandoning the latter altogether and writing in terms of the former: 
have not discussed the impact of this tendency. 
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FOOT NOTES 


For Sinhalese diglossia, see De Silva 1967, Gair 1968. 

See Godakumbura 1955*, Wijeratne 1948*. All discovered 
inscriptions are published in the volumes of Epigraphia 
Zeylanica. 

This is a commentarial work called the dhampiya atuva 
gatapadaya. See Godakumbura [955% 2251, 


See Godakumbura 1955*: Intro, and Hettiaratchi 1950*: Intro. 


For panegyrics and war poems see Godakumbura 1955*%2 270 
Ihave used the standard system of transliteration employed 
in SA languages. 

This ‘translation’ is mine. The transcription is phonemic as 
per Coates and De Silva 1960. 

See Godakumbura 1955*: 9 and Kulasuriya 1960*. 


See Godakumbura 1955*: 56ff., Sannasgala 1947*, Kulasuriya 
1961*. 


In the general election in 1956 which gave a landslide victory to 
the Sri Lanka Freedom Party the SLFP had the full support of 
these people. The subsequent change of government to the 
United National Party was likewise due to the pressure brought 
to bear by them. 


This controversy, which took place in 1862, is reproduced in 
Dharmabandhu 1947*. 


These publications, which are in Sinhalese, are: Prajnarama 


1957*: Intro, Paranavitana 1956-7*, De Silva 1962*, 1963a*, 
1963b*, 1963c*. 


See works cited in note 1]. 


Those cited in note 12 and novelists like S. Gunasinha, 
M. Ratnayaka etc. 


See De Silva 1965*, 1970*. 
See note 15. 


Game Budugunalankarya: bayankaraya ‘frightening’, for Skt. 
bhayankara (ya); liccavi ‘the Licchavi clan’ for Skt. Licchavi,etc. 


Compare the non-Sanskritic alphabet with the phonemicization : 


of Sinhalese proposed in Coates and De Silva 19 
Silva 1963d, 60 and De 


es Ce 
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19. The Sanskrit word matra is generally translated as mora. 


* References marked thus are to be found in the list of references 


immediately following. Others will be found in the General 
Bibliography. 
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International Journal of Dravidian Linguistics 


THE INDIAN LINGUISTIC AREA REVISITED* 


M. B. Emeneau 


India (the Indian sub-continent) is a linguistic area, of the 
Balkan, Caucasian, Southeast Asian type, in which there are 
‘languages belonging to more than one family but showing traits in 
common which are found not to belong to the other members of (at 
least) one of the families’ (Emeneau 1956, p. 16, n. 28). This concept 
of the Indian linguistic area was suggested by several scholars of the 
19th century. A sketch of the history of the concept, with biblio- 
graphy, is given admirably by Kuiper (1967) and by Krishnamurti 
(1969b, pp. 323-5), and here it is necessary to mention only Jules 
Bloch’s work of the 20’s and 30’s of this century, which was in effect 
the stimulus of my work of the 50’s and 60’s. Kuiper’s 1967 article 
very importantly spelled out some of the theoretical and chronological 
implications of the occurrence of retroflex consonants, of gerunds 
(non-finite verb forms), and of the quotative particle iti in the 
Rgveda, including, among other matters, that we must accept 
pre-Dravidian influence upon pre-Indo-Aryan in order to explain 
these full-fledged Rgvedic occurrences. My 1969 paper on onoma- 
topoetics attempted to provide more methodological insights into 
areal linguistics. That of 1971 (actually written in 1968, like the 
1969 paper) reverted to the much mooted problem of borrowed 
vocabulary and its chronology, but also attempted a perhaps 
premature discussion of causative formations (pp. 38-42). 


Little more has been done on this problem in the early 70’s. 
However, a recent paper by Burrow (1971) on “spontaneous cerebrals 
in Sanskrit” shifts the focus on the Indo-Aryan retroflex consonants. 
As part of the painfully slow advances of scholarship in the Iranian 
field, including difficult decipherment of Middle Iranian documents 
and painstaking analysis of content and linguistic history, it has 
slowly come to be realized that numerous Sanskrit items which 
contain retrofiexes have Iranian cognates with dentals. These have 
been published item by item, especially by Sir Harold Bailey. Burrow 
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in his paper sums up these identifications, and with this realization 
that retroflexes of Sanskrit may represent Indo-Iranian and, conse- 
quently, Indo-European dentals he proceeds to further identifications 
of Indo-European etymologies for Sanskrit forms of this kind. 
Satisfactory etymologies are thus attained for long problematic words 
such as guna- (two homonyms), pandita-, etc. 


Neither Bailey nor Burrow goes further than to speak of two 
developments for the old dentals. This leaves us with the old bugbear 
of the neo-grammarians, ‘sporadic’ development, an unexplained 
split in the same phonological environment. I would suggest that the 
bilingual situation provides an explanation. We must postulate that 
Sanskrit was handed down at some early period by a majority of 
Speakers who learned it as a second language, their first language 
being Dravidian. In their first language there were contrasting dentals 
and retroflexes; in Sanskrit, or we had better say pre-Indo-Aryan, 
there were only dentals and some allophones of dentals “backed’’ 
towards the Dravidian retroflex position. Assignment of these 
backed allophones to their own Dravidian retroflexes was easy for 
native Dravidian speakers. Mistakes in the assignment of Sanskrit 
pure dentals to their own retroflexes was also possible. These in the 
iong run could be transmitted as retroflexes, and some of them 
obviously are the ‘‘spontaneous’”’ retroflexes which Burrow identifies. 
To call them ‘‘mistakes’’ may be not much more satisfactory than to 
call them ‘‘spontaneous”’ and “‘sporadic’’, 1 but it does, I think, put 
them in a realistic socio-linguistic setting. 


The hypothesis of India as a linguistic area seems by now to be 
an accepted one. But the evidence for it is still being slowly accumu- 
lated, and in this paper there are presented two pertinent matters. 
The first isan instance of a morpheme with a complex semantic 
structure; the two language families have the same semantics, but 
quite different phonological representations. The second matter is 
the socio-linguistic one of the caste structure and its representation 
in language. 

(1) Sanskrit api: Dravidian*-um 

Sanskrit api has some uses which are inherited from the 
Indo-European state of affairs for *epi (and *opi), and are quite 
parallel with the uses of Greek epi, viz. verbal prefix (chiefly Vedic), 
and with a noun case-form (rare, Vedic). Its use in Sanskrit as an 
interrogative particle beginning a sentence seems to be unique. The 
group of usages which are the point of the present study belong to 
it as an enclitic particle (i. e. enclitic almost without exception), and, 
though one can see an Indo-European origin for these usages, their 
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development is unique and proper to Sanskrit alone. Whitney’s 
definition of apias a verbal prefix is ‘unto, close upon or on’ 
(1889, p. 396). Periphrasis of this as ‘close connection with’ seems 
possible. This would also seem a possible basic meaning for the 
following usages of the enclitic particle. 


Usage 1: ‘also’, i. e. this as well as, in close connection with, 


the previously stated noun, non-finite verb, predication, etc. 


in series, usually but not always in combination 
he non-enclitic api ca, both ‘and also’. 
ed construction in competition 


Usage 2: ‘and’; 
with ca, i. e. ca ‘pi, but also t 
This is not avery strongly develop 
with the enclitic ca ‘and’. 


Usage 3: in concessive constructions, ‘even’, either with yadi 
‘If? (yady api ‘even if, even though’, often followed by the correlative 
tatha ‘pi ‘even so’), OF following a single word or phrase. In either 
type of construction the meaning is: ‘if X is/was added to the 
situation, still Y, the opposite of what is to be expected as the 
result of X, will be/was the case’ (X may be positive or negative, as 
may Y independently of X); exx. ‘even though he comes/came, I will 
do/did what I plannened’, etc.; ‘even though a boy (Sanskrit: balo 
‘pi), he had the characteristics of an adult.’ 


These three usages (1 to 3) are ‘additive’, adding items to 
achieve a totality, in usage 3 a total, but unexpected, situation. 


Usage 4 is explicitly a ‘totalizing’ or ‘summing’ one: following 
a numeral or numeral phrase, api indicates that all members of the 
numbered group, without exception, share in the statement; it is 
sometimes redundantly used with sarva- ‘all’, sakala- and SSS 
‘all, the whole’; sometimes it occurs with adverbs such as sada 
and nityam ‘always’. E.g. dvav api ‘the two of them, both’ 
sarve ‘pi, ‘all of them, without exception’, sakala ‘pi ‘the whole of ie 
This summation is found in references back toa group once it has 
been defined, or in reference to a ‘natural’ group (the two 
all the gods, the directions, etc.). c. 


Usage 5 may be interpreted as an extension of 4: api with an 

interrogative pronominal form or derivative produces an indefinit 

phrase; E.g, ko ‘pi ‘whoever, someone, anyone’. Th +e 

development is ‘all who ...’. ie 

tions is (according to Renou 196], p. 378, § 262): interrogati 

or cana is Vedic (usage with ca is inherited cian ae tanorckn: 
> 


of. Latin quisque), interrogative + api classical, beginning with 


The chronology of indefinite construc- . 


eg ee 
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Manu, the grammarian Patajijali, Kautilya; i. e. api as a connective, 
synonymous with ca in usage 2, replaces earlier ca in this construc- 
tion, asa result of their partial synonymy. It should be noted 
too that there is competition between ca, api, and also the old cid, 
in this construction in the classical language. 


Of these five usages of api, usages 1 and2 are Vedic and 
Classical, usage 3 is Vedic and classical, though yady api ‘even if’ is 
not found before the brahmaya texts, usages 4 and 5 are classical. 
Usage 4 needs a little more chronological demonstration than is 
found explicitly in the reference books.2 Speyer’s earliest occurrence 
(1896, p. 71, §232a) is Manu 1,16 tesam sanndm api ‘of all these six’. 
Renou (1961, p. 517, § 382) says ‘usage proprement cl[assique]’ and 
considers, unnecessarily, that api with ubhau ‘both’ and sarve ‘all’ 
is used ‘expletivement’; ‘redundant’is the just word here. Apte’s 
example (1898, p. 171, § 247), sarvair api ‘by everyone (not excluding 
even one) ’,is fromthe simplicior version of the Paficatantra,3 not 
necessarily very early in the classical language. The great lexicon 
of Béhtlingk and Roth (1852-75) gives references to Maau, 
Ramayana, and later authors. One should add examples from the 
Mahabharata, to show that the epic language in general knew the 
usage; e. g. 1.225.19 sarve trayo ‘pi ‘all three of them’. Several 
occurrences can be quoted from the Bhagavadgita: 1.27 senayor 
ubhayor api ‘in both the armies’, 2.16 ubhayor api...anayos ‘of both 
these’, 13.19 ubhay api ‘both of them (accusative)’, 4.30 sarve ‘py ete 
‘all these’. A passage from ASvaghosa’s Saundarananda (Johnston 
1928; 10.19cd) shows its use with samasta-: citram samastam api ke 
cid anye sanndm rtinam sriyam udvaheanti ‘some other (trees) bear 
the glorious beauty of the six seasons, al/ at one time’. The 
Paficatantra reconstruction by Edgerton (1924) is perhapsto be con- 
sidered as early as the epic. Two examples have been found that 
must derive from the original text; both are in verse, and both have 
ubh4v api ‘both of them’ (p. 123, book 1, vs. 123; p. 302, book 3, 
vs 46). Otherwise, there are numerous examples in this text; e. g, 
p. 250, book 2, sent. 185 trayo ‘pi te ‘all three of them’, p. -270, 
book 2, sent. 247 catvaro ‘pi te sarve ‘all four of them’ (redundancy 
of api with sarve), p. 292, book 3, sent. 49 astasv api diksu ‘in all 
eight directions’; unfortunately, none of these latter is guaranteed to 
be the original wording of the text, but they are close to it. 


All these usages continue into Middle Indo-Aryan. For Pali, 
one may refer especially to the very full article in the Copenhagen 
dictionary (Trenckner 1924—), s. v. api. Usage 5 is rare, in com- 
petition with ci < Sanskrit cid. Usage 4 again should be especially 
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referred to (vol. 1, p. 290, Ist col., sect. ©, and also in Andersen 
1907, s. v. pi and under the numerals; there are some examples from 
the canonical texts); e. g. ubho pi ‘both (of us)’, amhehi dvihi pi 
‘with us two’, pance pi upadanakkhandha ‘the five skandhas which 
are the basis of clinging to exisence’, to cattaraspi pandita ‘those 
four wise ones’, sakalo pi lokasatnivaso ‘the whole society of men’. 
In Prakrit the usages continues; cf. for usage 5 kovi, kavi, kimpi. 
Usage 4 is exemplified in Pischel (1900) in the paragraphs on the 
numerals (pp. 309 ff., $436 ff.); E. g. dohi vi bahahim ‘with both 
arms’, Several examples from the Kalakacaryakatha (Brown 1933) 
are: p. 40, line 29 pancdnauim pi rayano ‘all ninety five kings’, line 
32 savvesim pi ‘to all of them’. 


This OIA and MIA semantic structure and its unitary repre- 
sentation have not survived straightforwardly in Modern Indo-Aryan 
(NIA). In all of the languages usage 2 is represented by forms 


different, with very few exceptions, from those of the other usages. 


For the rest of the usages (1, 3, 4,5) avery few languages can be 
thought to have retained the older semantic system; at least, the 
forms have retained identity, even though it is by no means clear 
that the modern forms are genetic developments from the older ones. 
But in most of the languages some degree of disintegration of the 
system has taken place. In the surface structure different forms 
represent the different usages toa greater or less degree. Whether 
one can say that the semantic structure survives, becomes a very 
moot problem, which is discussed below. 


Finding the relevant forms in the available NIA material is not 
very easy. Recording and analysis are often defective, especially for 
usage 4. Tabulation for a few languages follows (in this and in what 
follows vowel quantities fluctuate, without my being able to interpret, 


and, I suspect, without there being any explanation except lack of a 
standardized spelling): 


Usage 1 2 3 4 a 
Marathi hi ani -hi, -i —hi, -i ~hi 
. bi va ~ph- 

Maithili -o, -hu 0, do -O, -ao -o, -hi =T =u 
aor, ar 

Braj hi, haa = aru 2 -ha, -hi hé, hi 

ce ~U, -0 0 2 Sa 

Hindi-Urdu bhi aur bhi -~6 -i (and 

bhi 


other forms) 


=< lUlUCmS 
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In usage 2 (‘and’) the origin of some of the forms is clear. It 
is to be noted first that the commonest form in Sanskrit, ca (CDIAL 
4533), seems to have disappeared with hardly a trace, except in 
Sinhalese (OSi. -d... -d, Si. -t... -t). The reason is not far to 
seek. Sanskrit ca became in some Prakrits a (written ya in certain 
contexts in some Prakrits), which, if an enclitic, would phonologi- 
cally have been evanescent in most contexts in the NIA languages 
descendant from those Prakrits. Inconvenient loss of the morph 
was made good from other possibilities in Sanskrit and MIA. 
Sanskrit aparam ‘further, besides, moreover’ (CDJAL 434.3), MIA 
avaram, yielded numerous forms withr: Hindi aur, Oriya aur, Old 
Maithili (Chatterji and Misra 1940, p.6) adoro; with metathesis, Old 
Braj aru, Nepali aru, Assamese aru; with various losses, Bengali 4r, 
Panjabi ar, Nepali ra. Some form of anya- ‘other’ (? anyat ‘another 
thing’, anyac ca ‘and another thing’, anyani ‘other things’; CDIAL 
399), through MIA an(n)a-, yielded Marathi ani, Gujarati ane, 
n¢, Oriya ane, Nepali ani. All these forms connect coordinate items: 
A aur B; the same construction is found in the following. This 
class of forms includes: Hindi au, Oriya au, Sindhi ai, Old Maithili 
(Jha 1958, p. 567) 0, ao, 40, Bengali 0, Assamese u, Khowar 0, 
Gypsy (various dialects) 6, a, 0. On these there have been divergent 
opinions. Turner includes them with the r forms (CDIJAL 434.3), 
presumably merely with loss of r from forms of the type *aur; he is 
following Beames (1879, p. 270, $83). But he is uncertain and 
suggests (as less likely) the possibility of derivation from Sanskrit 
uta (CDIAL '700) for some of these items, viz. those with forms 
0, 6, a (Assamese, Khowar, Gypsy; he does not mention Bengali), 
deriving them through Prakrit ua, o. On the other hand, Hoernle 
(1880, p. 395, § 559) thinks (‘probably’) of ‘Pr[akrit] avi, Skr. api 
or apica’. Just how the derivation proceeds is not certain, but since 
the connective comes between the coordinate items, the enclitic api 
would seem impossible and only non-enclitic api ca can be considered. 
Then Prakrit avia (<Sanskrit api ca) would be the intermediate 
stage, with loss of final vowels. Turner (CDIAL 486) takes api ca 
‘also’ to be the original of the alternative Sindhi form ai, which 
Beames gives as 41 and takes to be merely secondary to ai. Marathi 
va ‘and’ should also be quoted (perhaps also Pashai vo ‘also’). 
Turner fails to give the Marathi form, but derives the Pashai form 
(CDIAL 485) from api. It would seem, prima facie, that Marathi 
va, which comes between the coordinate items, should be derived 
from Prakrit via, i.e. Sanskrit api ca. These au/o forms remain 
problematical. I should be inclined, unless there are phonological 
reasons unknown to me, to follow Hoernle in deriving this group 
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ed only be noted again that in 
Sanskrit usage 2 is not very strongly developed, but does exist. On 


the other hand, it will be found below that usages 1, 3, and 4 are 


represented in some languages by o/u and a0; this may make 
origination of 2 from the same sources more plausible phonologically. 


and Marathi va from api ca. It ne 


Finally, in Hindi Urdu ‘also’ form bhi is used in non-standard 
or non-normative speech in usage 2 (‘and’); it is enclitic as is 
Sanskrit ca or api, and the structure is X bhi Y bhi (e.g. ‘we are 
all going, I bhi, my children bhi’). So I am informed by Hindi-urdu 
speakers, as well as by foreign observers of everyday speech in the 
streets. It will be noted below that Gondi borrows Hindi-Urdu bhi 
in usage 1 and also in this usage 2. §.K. Chatterji (1942, p. 225) 
quoted a verse which was composed by the Urdu poet Sir Muhammad 
Iqbal in a self-consciously Hindiized form of his poetic language; 
it begins: sakti bhi santi bhi ‘strength and peace’. Chatterji’s 
remark that this was written ‘in a spirit of condescension, one would 
suspect’, probably refers to this non-standard use of bhi. 


Once usage 2 is thus practically eliminated from the semantic 
structure, it is found that in some of the languages the remainder of the 
structure has not suffered disintegration (whatever the origin of the 
particular forms may be). E. g. Marathi has hi (? hi) for usages 1, 4, 
5, and for 3-i ( jar-i. tar-i) which in Old Marathi was in free variation 


with- hi. In usage 1 bi also is used, and in usage 4 there is variation 
for the numerals ‘two’ ‘three’ and ‘four’ with the forms dogh-, tigh-, 
caugh-(masc. -e, fem. -i, neuter -é); but this variation does not 
invalidate the semantic unity that we have found.4 


A situation similar to that of Marathi seems to be found in 
the early Braj Bhasa prose, as represented in the language of Indrajit 
of Orcha (McGregor 1968). Unfortunately, usage 3 (concessive) 
does not occur in the texts, except with the Sanskrit yady api. 
However, usages 1, 4, 5 are all apparently represented by the same 
form, written variously hi, ha, a. 0 ( $§3.55, 95, 228, 231-2, 261-6). 
it is suggested (§ 2.30) that h ‘Shas a tendency to disappear hetween 
vowels... first after long vowels’’, and this form, in usage 1 at 
least, is singled out as showing the loss very commonly. 


Maithili, as presented by Jha 1958, clearly has the same forms 


(-o, -hi) for usages | and 4, and possibly even for 3 (jaio=Sanskrit - 


yady api) and 5 (jehd ‘whatever’) (88590, 782). Bhojpuri, in Tiwari’s 
account (i960), has -o in usage 1 and -6, -ii in usage 4 (§ 419) and 
even usage 5 (§ 464); I have been unable to find usage 3. 
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Perhaps the clearest instance, superficially at least, of disinte- 
gration of this system is seenin modern Hindi-Urdu, for which I 
refer to Kellogg 1938. Here usage | is represented by the form bhi 
(660); note also the occurrence of bhi in usage 2 mentioned above: 
Usage 3 (concessive) is in literary style represented by the Sanskrit 
yady api; but colloquial usage has jo bhi, with bhi sometimes com- 
pletely separated from jo ($664: jo dp mujhe tyag bhi karé ‘even 
though you should forsake me’; for jo ‘when, if’, see Turner, 
CDIAL 10401). Usage 4 is represented by -6 (§223). Usage 5 is 
represented by -i (aad other forms). It must be noted, however, 
that even this sketch of usage (with bhi in usages 2 and 3) does not 
exhaust the non-normative use of bhi. Ramchandra Varma (1966, 
pp. 195-6) notes that in usage 5 bhi is added to the indefinite pro- 


nominal forms (E. g. koi bhi, kahi bhi, etc.), but teaches that this 
is inelegant and to be avoided. 


Historical derivation of the whole system of forms presents 
difficulties in almost all details. The forms of usage 1 (‘also’) one 
would expect to be derived from Sanskrit api. In fact, those NIA 
forms with initial labial (v, p, b, bh), e. g. Hindi-Urdu bhi, Marathi 
bi, Sindhi pi, bi, Oriya bi, Gypsy vi (Turner, CD/AL 485), would 
seem to present little difficulty, considering that Prakrit has, besides 
avi, the forms pi, vi, without the initial vowel; Bloch 1915 (pp. 131 
f., § 127) has found explanation for p > b in apocopated initial 
positions. The difficulty that emerges here is the aspiration in 
Hindi-Urdu. 

This aspiration can hardly be considered apart from the h that 
occurs in many other forms which show no labial. One may instance 
Marathi hi in usages 1, 4, 5, and even 3 (Old Marathi; see above). 
Here h is found in older stages of the language and shows some 
instability even there, e. (Master 1964, p. 102, §190) pamecahi/ 
pamcai, and in usage 5 -hi becomes -i in modern Marathi (e. g. 
konhi ‘some one’ > koni). One may instance also the Braj set 
which has ha and & alternating in usages 1, 4,5, and the indefinite 
pronoun forms (usage 5) given by Chatterji 1926 (p. 845, § 588) 
with and without h (e. g. Bengali kéo, kéh6, Bhojpuriya kéha, kéa), 
as well as other forms. 

This h has been generally assumed to be somehow derived 
from one of the ‘emphatic particles’, viz. MIA hu (whose derivation 
is usually taken io be: Sanskrit khalu > Prakrit khu > Apabhramésa 
hu), or Sanskrit and MIA hi. Both hu and hi are mentioned in 
Chatterji 1926, p. 845, §§588, 589, in discussing usage 5. McGregor 
(1968, p. 200, §3. 228), in dealing with Braj, takes hi/ha@ in usage 1 to 


ijdl Vol. IIT No. I 100 Emeneau 


be derived from Sanskrit khalu, i.e. ApabhramSsa hu, but he adds: 
“(with api?)’’; he is apparently following Baburam Saksena (Evolution 
of Awadhi, §377, which is not available to me). On _usage 4 (P. 208, 
§3.261) he gives this (but only alternatively) as a derivation, without 
mentioning api, following Tiwari 1960 (p. 124, $419), who is following 
Chatterji and Misra 1940 (p. liii, $40); Jha 1958 (pp. 577 f., $782) 
gives the same explanation. Hindi-Urdu bhi has been taken by 
Beames 1879 (p. 271) and by Kellogg 1938 (pp. 389 f 660} to be 
derived from Sanskrit api hi (Kellogg adding Prakrit bimhi), i e. 
with addition of the ‘emphatic particle’ hi; the implicit explanation 
here would involve anticipation of the voiced aspirate in the initial 
labial (‘metathesis’), This last combination, however, Sanskrit api 
hi, as well as Prakrit bimhi, seems to be nothing but an ad hoc 
invention to explain bhi, not really found in the texts. 


On the other hand, it has, I think, not so far been noticed 
that Apabhraméa does have instances of vihu and pi hu in usages 
1, 3,4, 5. This at least is the situation in the Bhavisattakaha edited 
by Jacobi 1918 and the Sanatkumaracaritam edited by Jacobi 1921.5 
Jacobi 1921 (p. 150,s.v. hu) says that hu occurs only in this 
construction. The former text has only taha vi hu (=Sanskrit tatha 
‘pi), but the latter has examples of all usage 1: 638.3, 710.5, 754.6, 
766.6, 778.1, XXII, some of these even approaching usage 2 ‘and’: 
usage 3: 766.3; usage 4: 711.1 [d6 vi hu = Sanskrit dvav api] 625.1 
[saya vihu = Sanskrit sada ‘pi], XIV [niccam pi hu = Sanskrit 
nityam api]; usage 5: 507.6[k6 vi hu = Sanskrit ko ‘pi], 780.9 [ké 
vi hu = Sanskrit ke ‘pi]). This being so, we may be sure, I think, 
that Hindi-Urdu bhi is derived, not from an invented Sanskrit api 
hi, but from the actually occurring Apabhramsa p/vi hu (even 
though there seems to be no occurrence of this combination in IA 
before Apabhramsa, except perhaps for Prakrit occurrences of 
taha vihu). We should invoke the anticipation of voiced aspira- 


tion in the initial labial, that we found implicit in the Beames- 
Kellogg explanation. 


Having achieved this derivation, we are warranteed, [I think, 
In assuming the same derivation for all other forms with the aspirate. 
E. g. the Marathi hi forms (and those i forms derived therefrom with 


loss of h) are so derivable; the vowel in hi is that of api, and u of hu 
has been lost. 


| The ha forms of Braj (and those of Maithili also) one would . 
like to derive in the same way. Certainly, h ought to be derived in 
the same way. The problem here in the vowel i, and the vowels o 


and ao found inthe Maithili alternative forms (-o, -ao, -hi). In 
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fact, u and o seem to be interchangeable in this. context, viz. as short 
and long respectively of the same vowel quality. So at least 
Chatterji 1926 regards them in his treatment of the indefinite pro- 
nouns (§§$588-90); kéi, kéo, kéhi, kéh6 and quoted indiscriminately 
as if they were interchangeable. Chatterji finds it possible to derive 


-6 from the -v- resulting from Sanskrit api > Prakrit vi, and -ho 


_- 


from this -d ‘‘with influence from the emphatic particle hu, ho’. 
Can we simplify all this by positing that in the enclitic (i. e. 
accentless) sequences avi and avi hu, the vowels are lost variously, 
v may be vocalized to o/u (or possibly we may think of av > 0), and 
h may be moved to be beginning of the combination (and then ina 
later stage of the language lost)? Or, in the matter of the position 
of h, may we think of voiced aspiration as being a type of syllabic 
production aud not really susceptible of linear positioning in the 


Statement? This would yield variously -o and —-ao from avi, —hi 
(> 0) and hi (> -i; Marathi) from avi hu. 


Are we now in a position to find that Hindi-Urdu -6 (usage 
4 with numerals) can be fitted into this type of derivation, as we 
should like to do considering that it represents semantically usage 4 
of Sanskrit api? Certainly it is equatable with Maithili -o, Braj -d, 


-f, -d, and we may note that Kellogg (1938, p. 152, §224) quotes 
another type of Braj as having -aii and Maithili as having -6 as 
well as -o (as given by Jha 1958, pp. 577f., § 782). Moreover, 


Braj has forms with —ha, -hi, and is said to have also -ahi. I. Sy. it 
is tempting to consider Hindi-Urdu -6 as being derivable, like 
Maithili -o (in usages 1, 3, 4) from Prakrit avi, with loss of both 
vowels, and vocalization of v. Or, -ahti may yield -6 through the 
intermediate -at; loss of h has been clear enough in some of the 
forms discussed above, and is seen in other Hindi-Urdu develop- 
ments. Certainly, Kellogg (1938, pp. 165-6, §250) and McGregor 
(1968, p. 208, § 3.261; alternative explanation) can hardly be followed 
in deriving Hindi -Urdu -6 and related Braj forms from Sanskrit 
~ayam in such forms as vimsSatayam ‘group of twenty’ (> Hindi 
-~Urdu bis6); the phonolgy is possible (to the extent at least of: 
—ayam > Braj -ai > Hindi-Urdu -6), but such Sanskrit forms 
(they are rare) do not really occur in the relevant semantic usage. 
The textbook statement (also Hoernle 1880, p. 272, § 424) that the 
Hindi Urdu -6 form is, or is identical with, the oblique plural 
explains nothing. 
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We may now revert to the ‘and’ forms (usage 2) of the type 
Hindi au, Bengalio, Assamese u, etc. If we save found a tenable 
explanation above for the -o/u, -ao forms, 1t1S at least possible that 
the same explanation will hold forms of usage 2, even though api ca 
(and not api) would have to be assumed as the source type (Prakrit 
avia, via). 

Our hypothesis will hardly solve all details. E. g. can Nepali 
-ai in usage 4 be derived from Sanskrit api? It seems improbable that 
it can be that simple. Again, the Marathi stems of usage 4, dogh- 
‘two’, tigh- ‘three’, caugh- ‘four’, have voiced aspirates, but their 
velar position of articulation seems to defy explanation within our 
hypothesis. Master's suggestion (1964, p. 103 top), Prakrit donh-> 
*domgh- > Marathi dogh-, seems impossible, even in terms of 
Pischel 1900, $267; Katre (1966, p. 132, §235) may well have found 
the explanation ina compound with Sanskrit graha— ‘extent? (> MIA 


*_(g) ghaa-). 


Moreover, not all representations of usage 5 (indefinite 
pronouns) have been taken into account, and there is difference of 
opinion whether api is indeed the origin of the indefinitivizing suffixal 
material, apart from those instances already mentioned (Marathi 
-hi/ -i, Braj -hi, and those forms with —ha treated by Chatterji). 
Chatterji’s taeatment is typical (1926, p. 845, §$588-9; on the whole 
he follows Beames 1875, pp. 326-8, §72): the h-less forms, of which 
typical examples are Hindi-Urdu koi ‘some one, anyone’ (oblique 


kisi), and the Magadhan forms, E, g. Bengali kéu, Bhojpuri kéu, 
Assamese kéo, he derives from interrogative masculine singular 
nominative kas plus api. Hindi -Urdu koi, then, is straight forwardly 
derived by him from Sanskrit ko ‘pi, Prakrit kovi, with loss of MIA 
v. The Magadhan forms show the vowel e which is the Magadhan 
MIA development of the masculine singular nominative -as; the final 
—u/o are derived (as above) from the v of MIA avi (with loss of final 
vowel i). On the other hand, for the neuter forms (‘something’) 
Bengali and Maithili kichu, Braj kacha, Hindi-Urdu kuch, derivation 
is taken to be from Sanskrit kimcid (i.e. with the competing cid 
instead of api), final -u is supposed to represent hu, which is parallel 
to hu following vi. The problem of aspirated ch rather than un- 
aspirated c is solved by supposing spread from a derivative of kas cid 
‘some one’ in which Sanskrit éc yields MIA cch. The Asokan 
inscriptions notably provide evidence which makes this explanation 
plausible, viz. nominative singular masculine kechi (a ‘Magadhism’ at 
Mansehra; remade from *kacchi) and nominative singular masculine 
keccha (Dhauli, and presumably kechf{a] at Kalsi; remade from 


; 
t 
J 
5 
f 


ee ee ee ee ee 
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*kaccha <Sanskrit kag ca) (Bloch 1950, p. 69, §32, and index, p. 185). 
Turner, however, though he agrees with Chatterji in his interpretation 
of the neuter forms with ch (CDIAL 3144 kirncid and 2967 kascid; 
he says nothing about the origin of -u) and in his derivation of 
Bengali keu ‘anyone’ from Sanskrit ko ‘pi, Prakrit kovi (CDIAL 3518 
ko ‘pi), yet derives all other forms, Hindi koi, Oriya kei, Bengali 
kei, etc., from Sanskrit kag cid (CDIAL 2967 kdscid) > Prakrit koi, 
undoubtedly basing his view on Pali koci and the usual Agokan and 
Niya forms in -ci, and assuming that intervocalic v (< Sanskrit p) 
Should not be totally lost. This is probably correct. However, I find 
puzzling his inclusion in CD/AL 2967 (kagcid) of Assamese keo, 
Maithili keo, Old Awadhi keu, rather than with Bengali keu in CDIAL 
3518 (k6 ‘pi); I think that itis only possible to conclude that there 
has been inadvertence in this inclusion. 


Earlier it was said that NIA shows a greater or less degree of 
disintegration of the semantic structure represented by Sanskrit api 
and the MIA derivatives. Clearly the slightly developed usage 2 
(‘and’) has been replaced frequently by other forms than derivatives 
of api, with the possible exception of the au/o/u forms. Otherwise, 
Marathi shows least disintegration, since it has hi in usages 1, 3, 4, 5, | 
with few exceptions (e. g. -gh_ in usage 4); we may say, I think with 
confidence, that apart from usage 2 Marathi has retained the old 
structure, whatever the origin of its form hi (in all probability MIA 
p/vihu). We have found a similar state of affairs in Braj and in 
Maithili and Bhojpuri; the details are less clear than in Marathi, 
since the descriptions present so many variants in the written texts and 
the derivations are perhaps less clear. In modern Hindi-Urdu the 
picture at first sight seems like almost total disintegration: in 
normative Hindi usage | has bhi, usage 3 has Sanskrit yady api, usage 
4 has- 6, and usage 5 has -i and other forms. For usage 5 (indefinite 
pronouns) we have found reason to think, following Turner, that in 
fact the Sanskrit element cid has won out in competition with api. 
In fact, however, we have found that in non-normative Hindi-Urdu 
usage 5 is supplemented by addition of bhi, and even in usage 2 bhi 
is common. The form bhi in all its usages (1, 2, 3, and 5) and -6 in 
usage 4 have as origins respectively Apabhraméa p/vi hu and either 
this or merely older api. We have been tempted to treat api and 
Apabhram$éa p/vi hu as interchangeable and equivalent in the 
development (but essentially we are without a firm solution). If this 
were allowed, all the Hindi-Urdu usages would have the same origin, 
Can we treat the forms bhi and -6 as allomorphs of one morpheme 
and say that the semantic structure has to this extent not disintegrated? 
Can we go further and say that the derivative results of competing 
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forms in usage 5 (Sanskrit cid and api) may be treated as the same 
morpheme as that in the other usages? In other words, does 
Hindi-Urdu, in spite of its divergent phonological developments in 
some usages, still preserve the old semantic stricture? . ie.) 1t 1s 
possible, if such methodology is valid, to say that the old semantic 
structure has been preserved, no matter what the surface structure in 
the derivative languages. I am not sure what a native speaker’s 
intuition would let him say if he were asked these questions. Certainly, 
no Hindi-Urdu speaker seems yet to have asked himself the questions 
explicitly. Let us hope that some one may be stimulated to the 
inquiry by my outrageously premature treatment. 


But let us leave Indo-Aryan and look at Dravidian. 


In Tamil -um has the same five usages that Sanskrit has for api. 
We may refer to Arden (1934, pp. 128-9, §$240-1) for modern use, 
and that all these usages are old guaranteed by the treatment in 
Tolkappiyam (Collatikaram, sitras 33, 255, etc.; for usage 3 
[concessive] an example is given from a Sangam text to illustrate sdtra 
256: kanan kar enakkairinum ‘even though the forest proclaims that 
it is the rainy season’). In usage 2-um follows each of the co-ordinate 
forms (X-um Y -um). Forms which end with the ‘enunciative’ 
vowel (Bright 1972, p. 28: ‘‘zero in the underlying phonological 
structure’) add *-um to the basic form; this is true in Tamil, Mala- 
yalam, Kannada, Kodagu, Telugu. It should be noted that ella ‘all? 
is often ‘totalized’ with -m, and its derived forms with -um: ellam 
‘all (animals or things)’, ellarum ‘all (persons)? (with case forms, 
dative ellarukkum, etc.). 


The other SDr. languages have quite closely parallel systems. 
Malayalam is not different from Tamil (Frohnmeyer 1913). 


Toda has derivatives of *-um in all five usages. The general 
form is -m (historically with loss of the short vowel in a non-initia] 
syllable); E. g.i. nym ‘now also’, a:skm ‘for the house also’, pism 
obm ‘a bow and an arrow’. However, usage 3 (concessive) has some 
still unexplained variation between -m and -um; e.g.kwa'r fodnum 
‘even if the monsoon comes’, kwirtenum ‘though I give’, pi'nm ‘even 
if he went’. Usage 5 has generally -um at the end of a subordinate 
predication and -em in forms that are not final in predicate, although 
-m also occurs occasionally in both positions, especially in song 
phrases; E. g. oryem nink oj godti. ‘everyone fearing will give to 
you’, on etm bi. xiseni. ‘I cannot gO anywhere’, in uncnum ‘what- 
ever he thinks’, e. 1 igyum ‘wherever you will live’, Usage 4 has -m: 
E. g.ed xoykm ‘to both hands’, ifot wid eskm ‘to all twenty one 
Jeaves’, ir acdm ‘both these buffaloes’; when the numerating word is 
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el ‘all’, -m is found in, E. g. a9 elm ‘all that’, but when the initial 
element before el is a personal pronoun, the suffix is 70m, ¢.:2. nim 
elom ‘all of you’.6 


Kannada in the older stages has in all five usages -iim, in the 
modern language -a, i. e. with loss of -m and presumably compen- 
satory lengthening (Kittel 1903, §§284, 287, 290-4). 


Kodagu (like modern Kannada) has -u. in all five uSageS> 
derived from *-um (Emeneau 1970, p. 152). It combines with a 
preceding vowel -a to yield -o. (E. g. mara ‘tree’ > maro., parandaka 
‘when (one) talks’ > parandako, na. da ‘of me’ > na. do.). It is 
clear that in the oblique cases ella. ‘al]’ has this suffix (ella. ku. ‘to 
all’, ella, no. ‘all [accusative]’, ella. ru. ‘all people’), and, since 
final *-m is Jost in this language, it is probable that ella. in sucha 
phrase as mu. ndi kufifii ella. ‘all the three children’ represents 
*ella-m, i.e. that *ella and *ellam have coalesced into one form. 
For some further examples in this language, see Emeneau 1970, p. 152. 


Kota, alone of the SDr. languages, shows some disintegration 
of the structure. Usages 2 and 4 retain *-um in the form —m; in 
usage 2 (‘and’) either each of the coordinate forms has -m (X-m 
Y-m), or the first occurrence in a pair or both occurrences add to -m 
-i., which I suggest is an emphatic particle. For usages 1 (‘also’), 3 
(concessive; E.g., a.m ketme.ldan ‘even if [we] are ruined’), 5 
(indefinite; e. g., eyk o.nme.ldan ‘wherever [we] go’), Kota replaces 
-m with -dan.? The semantic structure is kept in two sections, and 
it is possible to suggest the same kind of solution, with two allomo- 
rphs of one morpheme, that was suggested for NIA above; it is to be 
noted that the two forms are mutually exclusive. This is the situation 
in the dialect that I studied. Another dialect, that studied by Tiru 
G. Subbiah, has -m alternating with -dan in the usages that I 
recorded with -dan. However, in one passage in Kota Texts 
(Emenzau 1946, vol. 2, p. 87, sent. 437 and note 46) a proverbial 
saying with -mi. ... -mi. ‘and’ was explained by quoting the proverb 
with substitution of -dan ... -dan, as if my informant knew such 
alternation for usage 2: culk murcvo. nk, od pa. Imi-. ke (k) kecvo.nk, 
od pa‘ Imi*. ‘To the man who broke the stick, one share, and to the 
man who did the work, one share.’ culk murcvo‘nkdan, od pa‘lm. 
ka'Imr urtvo ‘nkdan, od pa'lm. ‘For the man who broke the stick, 
one share, and for the man who plowed the field, one share.’ (pa. Imi, 
< pa‘lm ‘share’ + -m-i*.) 


In Telugu usages 1, 2, 4, and 5 are represented by the same 
forms, either lengthening of the final vowel of the form, or a suffix 
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-unnu (with several phonologically conditioned allomorphs, E. g. 


-nnu after i, -yunni after a, 4, ore); Arden 1937 gives the usages 
§§ 217-9, 237, 491, 591. Usage 3 (concessive) generally has a suffix 
-ina, which continues the old Telugu conditional -inaN, but even its 
long vowel is parallel to the lengthened vowels of the other usages; 
however, sometimes this usage is represented by -nni following a 
non-finite verb form ending in i, or by lengthening of this Jatter i. 
I. e. the semantic structure is found here in a CDr. language, as it is 
in SDr. in general. The origin of both types of modern forms from 
*-ym is made clear by the material given by Mahadva Sastri 1969, 
pp. 259-62. In the oldest material-um alternates with -un; the 
various later alternative forms result from (1) loss of the nasal with 
compensatory lengthening (as in Kannada and Kodagu), or (2) 
addition of the enunciative vowel (whence -unu, -unnu, and other 
forms). 


Two of the NDr. languages have traces of the suffix *-um and 
remnants of the semantic structure. 


Kurukh has a suffix with alternants -im added to words ending 
in a consonant, -m added to words ending ina vowel. A form in 
-um is also recorded alternating with one ending in -im, _ viz. 
(Grignard 1924a, p. 254, 1924b, s. v.) eksa ‘Anum, eksa’ anim ‘any- 
where, everywhere, somwhere’, but nothing further is known about 
this variant -um, and it is so far unknown whether it can be 
regarded as derived directly from -*um. Usages 4 and 5 are well attested. 
For the former Grignard 1924a gives an adequate account (pp. 31-2). 
‘to designate persons or objects already referred to, or having some- 
thing of a collective unity’, the inherited numerals for ‘two’ and 
‘three’ have suffix -im for persons (irbarim, nubbarim), and the 
numerals from ‘four’ onwards, which are borrowed from IA, have 
either -im for persons or -6 (cf. Hindi -6) for non-persons (e. g- 
cariyOrim, card), there being also IA forms for ‘two’ and ‘three’ with 
-6 for non-persons. For usage 5 Grignard 1924a gives numerous 
forms (pp. 196, 254-7). Grignard’s main description (1924a, pp. 
272-8) of this suffix seems to make it clear that its chief meaning is 
‘only, just’ (= Sanskrit eva); ‘also’ and ‘and’ are not meanings, in 
spite of Bloch 1946, p, 78. Usage 3 (Concessive) is not found. In all 
probability this Kurukh suffix -im/=m is related to # ‘um of the other 
languages so far treated (so Bloch 1946, p. 78, and Emeneau 1970, 
p. 152, n.4). The semantic structure of the other languages has 
disintegrated, but there still remains adhesi 
can hardly think of IA influence here, 


an . consoidering the prima facie 
disintegration in the NIA languages 


that might conceivably have 


on of usages 4 and 5; we’ 
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influenced Kurukh, and we should regard the Kurukh usages as 
inherited from PDr. The shift from the ‘also’ usage to that denoting 
‘only, just’ is paralleled, in reverse, by the Kota shift of *tan from 
the latter meaning to the former. The only available account of 
Kurukh’s close relative Malto (Droese 1884) has nothing relevant. 


The other NDr. language that has traces of *-um is Brahui. 
Bray (1909, p. 229, §435) recorded a form (h) um as meaning ‘also’; 
it follows the noun with which it is construed. Of his two examples 
one clearly has this meaning. Inthe other sentence which he gives; 
the word is repeated after two nominal forms, and one can convert 
his translation ‘not only ..., but into the bargain’ into usage 2 (‘and’) 
of *-um. In the published texts (Bray 1939) there are other 
examples, showing these two usages. These are precious relics, 
since ‘and’ is usually represented in Brahui by 6 (Bray takes it to 
be borrowed from the Iranian language Balochi). We have, then, 
in this language Dr. *—um in usages 1 and 2. 


This amount of evidence - the SDr. group almost entire, Telugu 
in the CDr. group, and two of the NDr. languages, in which latter 
the usages retained do not overlap, —allows reconstruction of a 
PDr. form *-um and for it a semantic structure which includes five 
usages. But thisis not all of the relevant evidence. 


In Tulu, whether or not we agree that this language isto be 
included, not in CDr., but in SDr. (with Subrahmanyam 1968b), we 
find our reconstructed semantic structure, all five usages being re- 
presented by the same form. The suffix, however, is not derived 
from *-um. It is -la (so all who have recorded it recently; Brigal 
1872, Manner 1886, Ramaswami Aiyar 1936 all wrote -14). To find 
evidence for the usages requires much piecing together of separated 
bits and pieces. Brigel 1872 gives examples of usage 1 and 2. Bhat 
1967 (p. 113) described usages 1 and 2; usage 3 is not described 
but occurs in an example (p. 94); usage 4 is not described but 
occurs in examples with manta ‘all’ (p. 105) and with a numeral 
(p. 116); usage 5 is not described but examples occur (pp. 100, 112). 
Kélver 1969 in his glossary (p. 60) gives for -la usages 1, 2, 4, 5, and 
(p. 59) under manda ‘all’ gives a number of examples with -la 
(usage 4); though he does not describe usage 3, there are examples 
in the texts (E. g. pp. 27-8). Bhat and Kedilaya (1967, p. 216) record 
two ‘particles’ -la, one for usage 1, one for usage 3. There is then, 
sufficient evidence for the five usages of the semantic structure. The 
derivation of -la (? -1a) is, highly probably, from *ella or *ellam,® 
which otherwise does not occurin Tulu (DED, DEDS DEN 718). 
One can conjecture that *ella~m) has replaced *-um, and has been 
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replaced by manta (Bhat), mands (K6lver), mande (Manner), mata 
(Brigel), manta/manta/mata (Bhat and Kedilaya), for which there is 
no etymology, unless indeed it be in DED 3847. Parallel phrases 
in Kodagu and Tulu may well be the point of departure for analo- 
gical replacement of * um by -la throughout the whole Tulu 
system; E. g. Kodagu mu:ndi kuffi ella’ ‘all three children’ (ella: > 
*ellam): Tulu e'li jevu ba’ ru] la ‘all seven female (je‘-vu) children 
(ba‘rulu, pl. of bale)’. 

On the remaining CDr. languages, we Can achieve some 
analysis, in spite of generally less than adequate description and, for 
some at least, almost total lack of texts which might provide 
unanalysed evidence. 

Kui, in Winfield’s description (1928, pp. 152-3, 161), hasa 
‘particle’ ve, which has usages 1 (though no very good example is 
given), 2 (enga ‘and’ is, however, more common),® 3 (added to 
conditional forms ending in -eka), and 5 (further examples, p. 48). 
Considering Winfield’s generally excellent analysis, it is probable 
that usage 4 (with numerals) indeed does not occur, but our lack of 
texts is particularly to be deplored, since usage 4 might be found in 
a large corpus. Burrow and Bhattacharya (1961) do not report it 
for the dialect that they observed. 


The very bad descriptions of Kuwi Schulze 1911, Fitzgerald 
1913) yield a suffix wa (Schulze), vwa (Fitzgerald), equivalent of 
course to Kui ve; they give evidence for usages 1 (Fitzgerald, p. 16) 
2 (Schulze, p. 89), 3 (Schulze, pp. 97, 196), and 5 (Schulze, p. 73: 
Fitzgerald, p. 21). Again, as in Kui, usage 4 is missing. Burrow and 
Bhattacharya (1963) in their report on this language give no example 
at all of this suffix. 


Konda (Krishnamurti 1969a) has an enclitic particle ba (pp. 
328-9, 10, 4), with usages 1, 3, and 5; instead of having usage 2 
(‘and’, which is represented by -e/-ne, p. 320), ba is used repeated 
to mean ‘or’.10 Usage 4 is not described, but in the texts there is 
a clear example: (p. 61, sent. 4.44) ri?er ba ‘both of them’ (fi? er 
‘two men’). This language, then, has ba in usages: 1,.354,5°5; and a 
development from usage 2. 


Pengo (Burrow and (Bhattacharya 1970) has an enclitic 
particle pa in usages 1, 3, and 5 (p. 61, 126; a further statement and 
example of usage 3 are on p. 101 $207). This form also occurs 
very frequently (p. 92, §188, and passim in the texts) in narrative 
texts following the “‘conjunctive participle’’ (i. e. gerund - presumably 
this is usage 1. For ‘and’ Pengo uses ar, dre. i.e. the ~anive form 
No examples have been found of usage 4 in the texts. ek 
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Parji (Burrow and Bhattacharya 1953) has a particle bele, 
which is not described, but which is found in the texts in usages 1, 3, 
and 5 (usage 1, E. g. p. 72, sent. 19, p. 110, sent. 44, p. 113, sent. 
69; usage 3, following conditional forms in -od, E.g., p. 131, sent, 
70 and 75, p. 142, sent. 158; usage 5, E.g. p. 108, sent. 31 and 34, 
p. 111, sent. 56 and 57, p. 112, sent. 62, p. 117, sent. 104), For 
‘and’ Parji uses aru, the Halbi form. No examples have been found 
of usage 4. 


The forms in these five languages are related. The closely 
related languages Kui and Kuwi have ve and wa, giving evidence 
for the two vowels e anda. Konda ba and Pengo pa both have 
the vowel a; the three consonants v, b, p are somehow related, but 
no historical solution is yet in sight. Parji’s bele seems to be related, 
with be- equivalent to Kui ve. If it were not that Burrow and 
Bhattacharya (1953, p 68) mention a Halbi suffix-li which occurs 
in the Parji adverbs anatle ‘from there’ and inatle ‘from here’. it 
would be tempting to find in bele some remnant of the ‘all’ word 
*ella- (this word occurs as such in Kolami and Kuwi; see DED, 
DEDS 718), i. e. be/a-ele, but this, in spite of Tulu -la, is perhaps 
pushing too hard to find a cognate. 


These forms are hardly related phonologically to *-um. On 
the other hand, the semantic structures of these languages must be 
derived from the five-usage structure seen in SDr., Telugu, and Tulu. 
All these languages have usages 1, 3,5; Kuwihas usage 2; Konda 
has usage 4, and a development of usage 2 to the meaning ‘or’. 
Usage 4 might well be found in more of the languages, if we hada 
larger corpus of texts. The semantic structure, asa PDr. semantic 
structure, then, is considerably bolstered by this CDr. evidence. 
That the form* -umis the more original representation of the 
structure, we have found to be guaranteed by its presence in SDr., 
Teluyu. and NDr. An innovating replacement of this form by that 
with a labial initial in many of the CDr. languages is paralleled by 
the innovation in Tulu, as well as by that in Kolami, which is still 


to be described. 


In the somewhat scanty Kolami material, forms ~na/ -nay 
represent usages 1,2,4,5 of semantic structure (Emeneau 1955, 
p. 63. $4.79). Usage 4 wasfound by direct elicitation from the 
informants, since the SDr. usage was well-known to me at the time. 
Usage 2 is represented most commonly in the Wardha dialect by ini 
(DED 35\c; cf. Ta. ionum, ini, Ka. inu, etc.), in the Adilabad dialect 
(SR) by ani, which is IA, or by inka, with which compare Kui enga and 
Telugu ika, iga (see note 9 above) and Emeneau 1955, p. 199, (s. v.ini). 
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Usage 3 is not found, but there are examples of the suffix 
used with non-finite verb forms, such as the gerunds (Emeneau 1955, 
p. 96, §5.111-2), which parallel the Pengo use of pe referred to 
above. 


Gondi material is almost entirely lacking in the various 
accounts. The dialect treated by Trench (Trench 1919, 1921) has 
for usage 2 commonly ani (Indo-Aryan), but there are also examples 
of repeated bhi. which is of course borrowed from Hindi (of the 
non-standard type); bhiis also found in usage 1 (both usages, e.g., 
in text, Trench 1921, p.27). For Koya there are two accounts 
(Subrahmanyam 1968a, Tyler 1969). Tyler records (p. 84) usage 3 
with a suffix -kanna, which I judge should rather be analysed as 
conditional -ku plus -anna, the more especially since in usage 5 
(p. 60) -anna is added to interrogative pronominal forms. Subrah- 
manyam records (p. 143) -a’na in usage 3, added to the conditional 
forms. These forms may possibly be cognate with the Kolami forms 
-na/-nay; however, two languages have in common for these forms 
only usage 5, since Kolami seems not to have -na for usage 3. 


Whether or not the Kolami and the Koya forms go together, 
it must be noted that Schulze (1911, p. 88) records as one of the 
Kuwi representatives of ‘and’ a repeated suffix -na, which must 
surely be cognate with the Kolami -na/-nay. Kolami surely has an 
innovation, perhaps based on something like the Kuwi usage 2 and 
expanded throughout most of the semantic structure. Whether this 
innovation replaced *-um or something like bele of Parji, a fellow- 
member of the subgroup which includes Kolami, or even the CDr 
v/p/b e/a, cannot be guessed. 


We end the Dravidian account witha well-evidenced PDr. 
*-um repesenting ail five usages of the semantic structure, attested 
in the most ancient Tamil texts and covering both South, Central, 
and North Dravidian. All else in the material so far availabie 
includes three innovations in form, one in Tulu, a second in Konda, 
Pengo, Kui-Kuwi, and probably Parji and a third in Kolami. In spite 


of these innovations, however, the semantic structure of *-um has 


been kept, either completely as in Tulu, or to an extent which includes 
three or more of the five usages. Usage 4 is most often missing in the 


record, but there is suspicion that it has often merely failed to be 
recorded. 


We have then the same five-usage semantic structure represented 
in Sanskrit by api, and in Dravidian or, better said, Proto-Dravidian 
by *-um. What is the historical relation between them? We can 
hardly think of accident, when the Semantic structure is of so specific 
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and complex a nature, hardly a universal except in the limited sense 
of an areal ‘universal’, There must be some historical relation. None 
of the Sanskrit structure is inherited straightforwardly from Indo- 
European, or even from Indo-Iranian. Usage 1 is Vedic, appearing 
already in the earliest texts; usage 2 is certainly derived from it, and 
examples from the early texts may be interpreted as showing this 
latter usage. Usage 3 (concessive), when found with single words or 
phrases (e. g. balo ‘pi), is said to be already found in the early texts; 
yady api ‘even if’ is said to be Vedic (so Renou 1961, p. 526), but in 
fact does not appear before the brahmana texts (Satapathabrahmana) 
in Bohtlingk and Roth’s lexicon (1852-75). Usages 4 and 5 are not 
found before the classical texts. This is clearly innovatory develop- 
ment within the history of Indo-Aryan. PDr. *-um with all five 
usages is found in the earliest Tamil records, roughly as early as the 
beginning of the classical Sanskrit record. Since to postulate a feature 
as PDr. means extrapolating it a considerable number of centuries 
earlier than these records, we may assume that PDr. had this feature 
at least as early as the emergence of usages 3, 4, and 5 into the Indo- 
Aryan record. Given bilingualism between the two familes in North 
India at this period, there is no probable solution of our problem 
other than diffusion from Dravidian into Indo-Aryan. The Sanskrit 
usages are essentially a calque of Dravidian *-um by Sanskrit api. 
Would it be too much to find that the uneven distintegration 
of the structure in NIA is due to a differential extinction of Dravidian 
in different sub-areas, earlier in the Gangetic valley, where E. g. the 
disintegration of normative Hindi has gone deceptively far, later in 
the area of Marathi, where there is hardly any disintegration of 
the structure? 


(2) Social structure and linguistic structure in India. 


The caste structure of Hindu society has many ethonological 
implications, and much has been written about these(E.g Karve 1961, 
Mandelbaum 1970). The linguistic implications have been less thorough- 
ly explored, though work has been done in recent years on caste dialects 
and their interrelationships (E. g. Gumperz 1971). But there is now at 
hand a body of material illustrative of the relationship betweea caste 
structure and vocabulary structure in various of the languagesof the 
Indian sub-contineat. This is presented, with discussion of socio- 
linguistic implications and of the materials bearing on the hypothesis 
of India as a linguistic area. 

An obvious section of the data consists of the vocabulary 


items which designate caste membership.11 In Sanskrit, for example, 
there are long lists of words for the male and the female members 
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of the very many castes. The earliest such lists in the record are 
well represented by those in the Mahabharata, book 13 (Anusasana- 
parvan), chapters 48-9, and the Manavadharmasastra, chapter 10, 
which are clearly related to one another in the details of the lists 
and undoubtedly represent a traditional statement that was formulted 
and circulated in the last pre-Christian centuries (probably post- 
Panini, since Panini contains only the oldest layer of terms). It is 
the classical brahmanic verbalization ofa sociological theory, in 
which the multiplicity of endogamou castes (jati), which are in part 
at least occupational groups, is derived from the earlier, Vedic state- 
ment of four varnas (also occupational), by positing reprehensibly 
mixed marriages between members of different varnas and subsequent 
between them and the resulting half-breeds in a potentially infinite 
series of combinations. The jatis identified in these texts seem to be 
only minimally identifiable with the jatis of the system as it is now in 
fact operative and known, but we are not concerned here with this 
problem. Such paired terms for male and female member of castes 
continue to be recorded in all the Indo-Aryan languages down to the 
present time (see the many items in CD/AL). 


Similar lists can be provided from the Dravidian linguistic 
family. An easily presentable example is that found in the miniature 
caste system of the Nilgiri hills (see below), with its four castes, each 
with its own language (or languages, since the Kurumbas probably 
have several languages). For three of the languages the lists are 
complete in my fieldnotes (the numbers following the items are 
references to entries in DED): 


Toda language Kota language Badaga language 
male female male female male female 
Toda 0°] toz,mox ton tody todava_ toduvati 
(342) (2885) (2885) (2885) (2885) (2885) 
Badaga ma.f madty mayv mayt baduga badugati 
(3956) (DEDS 3798) (3956) (3956) (cf. 4267) (cf. 4267) 
Kota kwi.f kwi.ty ko.v ko.ty ko.ta ko.ti 
(1468) (1468) (1468) (1468) (1468) (1468) 
Kurumba kurb_ kur (b) é kavar_ kavarc kuruma kurumati 
(1530 (1530) (cf. 1530) (cf. 1530) 


In all such lists the semantic structure is one of paired desig- 
nations of males and females. This is true whether the language of 
which the list is a part is Indo-Aryan, Dravidian, or Munda.l2 In 
occupational castes it is usually the male who practises the occupation 
When, as usually, the casts designation in Such cases refers to the 
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occupation, the designation of the male means ‘he who practises 
such-and-such an occupation’; that of the female means, not ‘she who 
practises such an such occupation’, but ‘female related (most import- 
antly as wife, but also as mother, daughter. or other female relation) to 
him who practises such-and-such an occupation’; E.g. Sanskrit kum- 
bhakara- ‘he who makes (kara-) pots (kumbhakari-) ‘the wife of the 
potter’ (feminine suffix -i ), ksatriya— ‘the ruler-warrior’, ksatriya- 
‘the wife of the ksatriya~’ (feminine suffix-a-).13 In all the other, 
non-transparent examples the semantic structure raust be the same. 
This, of course, tells us something about the relative status of men 
and women; the woman has status only in relation to a man.14 


Equally, and perhaps more centrally, sucha rigid insistance 
on paired caste designations reminds us that we are dealing with a 
social structure in which the designated groups are rigidly endogamous 
(whatever may be the further division of a group into secondary 
endogamous groups-stated in reverse, this is the grouping of jatis 
into a jati-cluster, which latter is labelled by the pair of caste 
designations; Karve 1961, pp. 10ff.), and membership in the group is 
derived strictly by birth. In such a society, in which the individual, 
male or female, has necessary and unchanging membership in one 
group, the designation of group membership is all-important. This 
isan all-India (i.e. Hindu) socio-linguistic trait. The semantic 
structure of this portion of the lexicon goes hand-in-hand with the 
social structure. Functionally, at least, if not necessarily histori- 
cally, the social structure has priority over the linguistic structure. 


But there are also historical implications. The invading Aryans 
did not bring this social structure with them. They either met it in 
India and adopted it in their process of Indianization, or they and 
those they met in India developed it together to produce the India 
we now know. Historically, then, we must assume that linguistically 
Indo-Aryan, beginning with Sanskrit, developed its caste-designation 
system as a part of the historical Indianization process. It is hard to 
see how this could have come about without assuming bilingualism 
as the operating factor in the convergence of social structure, and 
consequently of linguistic structure toc. Moreover, if those scholars 
are correct who argue that the caste structure was already there 
among the indigenes when the Indo-Aryan invasion took place 
(implied by Karve 1961, p. 15), it follows that this linguistic trait 
was indigenous in origin, probably most specifically Dravidian, and 


diffused into Indo-Aryan. 


There is some evidence in South Dravidian that the vocabulary 
items denoting the biological distinction between child and adult 
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are also infected by the caste structure. In Kodagu the terms 
mo‘vé9 ‘son’, mo'va ‘daughter’, makka ‘children’ (DED 3768) may 
possibly refer to children irrespective of caste. However, a small 
amount of contextual evidence makes it clear that for reference to 
children of local Hon-Coorg, i.e. lower, castes there are special 
terms, of which I recorded kutti ‘child of any caste except Coorgs’ 
(DED 1390) and pajja ‘Holeya girl’ (DED 3248). This then would 
tend to restrict mo‘vé", mo‘va, makka to children of the Coorg ie. 
upper, caste, when the discourse is on matters within the traditional, 
local system. Similarly, the Tulu lexicon records kundu (DEDS 
1371), paiyyé/paiy(y)eru (DED 3248), and jéru (DED 2311) as 
referring to a Pariah child; the implication would be that there is a 
certain amount of caste differentiation in the words for children. 
For Tamil and Malayalam there is little evidence. cirral is in the 
record for both languages as meaning ‘boy or girl cooly’ (DED 1326), 
and for Malayalam there is recorded in the same entry a pair 
ceruman/cerumi ‘slave boy/girl’ (but also ‘Pulayan/Pulachi’, i.e. male 
and female members of this caste, which ought to include reference 
to adults as well as children), Apart from this, I have no evidence 
and it may be that this is a locally restricted development (? Tamil 
Nadu and Malabar coast), that never spread to the rest of India. 
Even so, it is possible to suggest that South India is historically the 
area of greatest development of caste developed vocabulary and 
consequently is the locus from which there was diffusion to other 
parts of India-a conclusion which reinforces that which we reached 
above on other grounds. 


In addition to the caste designations and the evidence they 
provide for diffusion from Dravidain into Indo-Aryan, there are 
other facets of vocabulary which point in the same direction. 


A feature of the caste structure is the encouragement of 
difference, of non-uniformity, between the groups. There is, 
undoubtedly, much diffusion of traits between groups, in all 
directions. But the overwhelming impression on the Outsider is 
that this pressure for uniformity is balanced by an at least equal 
pressure to preserve or develop distinctive differences. All up and 
down the hierarchical ordering of the groups, external differentiation 
is a sign of a group’s position in the ranking. There js usuaily too 
a drive to establish a metaphorical distance between the groups, in 
an effort to maintain a ceremonial purity which might be destroved 
by contact with those outside the group. Such metaphorical distance 
is manifested as a physical distance in the Structure of multi-caste 
villages. It was well put by Karve (1961, p. 33). sae habitation 
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area in each village was divided into areas containing houses of one 
caste ..Besides the caste clusters the whole area was generally divided 
into two main habitation areas, one belonging to the higher castes, 
the touchables, and the other belonging to the lowest castes, the 
untouchables. Different castes among touchables and untouchables 
within these larger areas lived a little separated from the others. 
The village habitation area was thus roughly divided into as many 
units as there were castes in the village.’? This statement was 
based on conditions in Maharashtra and Andhra, but it is generaliz- 
able for all India.15 There is some evidence that this physical 
separation of caste living units is paralleled by a proliferation of 
differentiating names for such units. The evidence is scanty and, 
as it were, accidentally recorded for most areas. The best evidence 
that is available is that which I recorded in the Nilgiris, and even 
that has numerous gaps. 


Before the social disruption of the Nilgiris began a century 
and a half ago upon the arrival of the British and their opening of 
the area to extensive Indian migration from the plains, this area was 
the home of a miniature local caste system of four groups-Todas, 
Badagas, Kotas, Kurumbas. It had many of the typical caste-system 
features-a ranking of the four groups,!6 economic relations of the 
jajmani type, endogamy, and maintenance of difference and distance. 
One of the differentiating features is extreme even for India, viz. 
that each of the four castes spoke a different language (there are — 
probably more than four, since the Kurumbas, still little investigated | 
include several different groups with probably several languages). 
Distance is also maintained in a way that is possibly unique; for this 
there are economic reasons, though the factor of ceremonial purity 
also undoubtedly operates. There are no multi-caste villages; each 
caste has its own habitation pattern. The Kotas have seven villages 
scattered over the area, each in the center of its own agricultural 
tract and of its circle of jajmani relations by which its supplies 
pottery, iron tools, funeral music, etc. to Todas and Badagas in the 
neighboring area. The Badagas, relative newcomers to the area and 
suffering a population explosion apparently from the time of their 
arrival, have many villages, each in its agricultural tract. The Todas, 
originally using the whole area, except for parts occupied by the 
Kotas, as pasture for their buffalo herds, lived in clusters of three or 
four houses on the pastures, each cluster belonging to one lineage in 
a patrilineal clan; they also practised a seasonal nomadism 
(transhumance) for the best utilization of the pastures. As of the 
last century, the Todas had suffered diminution of the pasture areas 
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by alienation to the British need for urban areas and, earlier and 
more importantly, to the pressure of the Badaga explosion of popul- 
ation and agriculture; the Todas give the impression now that they 
occupy the interstices and are in process of being squeezed out. The 
Kurumbas have always lived in very small settlements or even 
isolated houses in the marginal jungles of the lower-lying Nilgiri 
periphery, and occupy themselves in large part with jungle pro- 
ducts. This very functional manifestation of the distance requirement 
is seen linguistically in the separate designations for the physically 
differentiated settlements of the four castes. I have complete sets 
of terms in the Toda and the Kota languages (the numbers refer to 
entries in DED; for examples of Toda usage, see not 16): 


Toda language Kota language 
Toda mod (3913) mand (3913) 
Badaga oty (3199), urr (643) paty (3199) 
Kota kwi' ko‘l (DEDS $200) ko. ka‘l (DEDS $200) 
Kurumba mut (4204) motm (4204) 


My Badaga material is very slight. It is probable that it has a 
similar set of terms, but I recorded only: mandu ‘Toda village’ 
(cf. 3913), and hatti (3199) or uru (cf. 643) ‘Badaga village’. 
Hockings records the Badaga word for a Kurumba village as kombe 
(cf. 1441). 


The houses making up the settlements are different in type 
and terms for the houses would be expected to be differentiated by 
caste. I have fragmentary material. In the Toda language a Toda 
house is a's (DBIA 165), a Badaga house poly (3309). In the Kota 
Janguage there was recorded no differentiation, pay being the term 
for ‘house’ regardless of caste (DEDS S428). In BadagaI have only 
mane (cf. 3911), the term for a Badaga house, and no indication of 
usage for the houses of the other castes. 


Evidence on village and house terms is fragmentary for the 
rest of the South Dravidian area, but what I am able to present 
makes it evident that the Nilgiri material is not untypical, and that 
infact South India in general has this linguistic trait. It should 
be noted as preface that the Tamil material represents in part separate 
villages, in part the situation described by Mrs. Karve, in which 
sections of a multi-caste village are identified by separate terms. 
The Kodagu and Tulu material refers only to houses. The Malayalam 
material is mixed. In Kerala (Malayalam) and Coorg (Kodagu), 
which are contiguous areas, the predominant habitation pattern is one in 
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which houses are not clustered in compact villages, but are sited 
each in its own agricultural area, with the houses of specialized 
occupational personnel near that of the landowner. The Tulu area 
is probably organized in much the same way. None ofthe linguistic 
data allows systematic organization of the kind that I was able to 
achieve for the Nilgiris data. The available material is as follows 
(the numbers following the items refer to entries in DED): 


Tamil 


céri ‘street, quarters [i. e. village section] of the Pariahs’ (1669) 
patam ‘street of herdsmen’ (3347) 

kuppam ‘small village of fishermen and other low-caste 

peple’ (1441) 

kumpai ‘settlement, especially of Paficamas’ (1441) 

palli ‘herdsman’s village’ (3309) 

pakkam ‘village of Kuravans’ (Burrow 1948, p. 382, 

quoting Kuruntokai 339; the Kuravans [1530] are a caste of 
fowlers, snake-catchers, basket-makers, and fortune-tellers; 
ci3532) 


Malayalam 


illam ‘house of Nambudiri Brahman’ (420) 
vitu ‘house, especially of Nayar or Janmi’ (4419) 
palli ‘small settlement of jungle tribes’ (3309) 
teru ‘street, a weaver village’ (2818) 

Kodagu 
mane ‘Coorg house’ (3911) 
palli ‘house of a Poleyé or of a Me. dé’ (3309) 
bu-di ‘hut of toddy-tapper’ (4419) 
pa‘di ‘hut of Kurumba’ (3347) 


Tulu 

bidu, bidu ‘residence of a headman’ (4419) 

ké'u ‘abode of a Pariah’ (1656) 

koppa ‘hut of the lowest grade of Pariahs called Marimanserw’ 

(1441) 

kotta ‘hut of Koragars’ (1713) 

Indo-Aryan material of this sort is very scantily attested, and 
yet there is some in the record. Eg. Sanskrit has pakkana- ‘the 
hut of a Candala or any outcaste [i.e. member of low caste], a 
village inhabited by savages or barbarians’ (the meaning ‘village of 
Candalas’ is clear in Mahabharata 12.139.12, 33, 41, 43), and palli- 
‘a small village, especially a settlement of wild tribes’ (lexical; 
pallighosa- ‘herdsmen’s  villege’ ibid. 12.312.20). Nepali khopro 
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chouse of a slave’ (CIDAL 3938 *khoppa-, extension with -d-) has 
an implication such as we are looking for. More specifically to the 
point is Marathi pada ‘a hamlet or a cluster of houses of agricultura- 
lists, a ward or quarter of a town, (Molesworth 1857; quoted in 


DED 334?). 


This pattern, then, of village/house designations linked with 
the caste structure is pan-Indic. The linguistic pattern seems, on 
our evidence to be more strongly developed in the Dravidian area 
than in the Indo-Aryan, and on the basis of distribution only we 
should be inclined, as we were before, to find the origin of the trait 
in the Dravidian area. This argumeat, of course, has so far ignored 
completely the presumptive Dravidian origin of the Indo-Aryan 
terms quoted;17 this should clinch the argument for the direction of 
diffusion of the trait. 


Besides the mere semantic structure of the vocabulary items 
that are infected by the social structure, some interesting features 
are found in their stylistic use. 


Terms which designate the caste membership of persons have 
an obvious use in talking about subjects in which caste membership 
is relevant. In addition, however, there are uses found in narrative 
usually artless prose, but also the ‘artless’ verse of Sanskrit epic, 
which are not at all obvious or predictable. Initial identification 
of narrative characters by caste designation is fairely obvious (e.g. 
‘In such-and-such a village there was a brahman named So-and-so’), 
though even this often contains a redundancy. I.e., to add to the 
identification of the character as ‘human male’ an identification as a 
member of a particular caste group, i.e. to say ‘a brahman’ rather 
than merely ‘ a man’, often adds a detail that is irrelevant to the 
action of the narrative. One can say that such irrelevant identification 
is a Stylistic trait which intrudes because of the strong emphasis in 
the culture upon this feature of secial organization. 


Narratives may continue with identification of the various 
actors in their successive actions, by means of contrasting deictic 
forms (E. g. ‘this man’; ‘that man’). But proper names may provide 
the mechanism for keeping the actors apart in proper contrast, or 
other contrastive labels may be used such as ‘the old man’: ‘the 
young man’, ‘the father’: ‘the son’, or the like. The use of such 
pairs is familiar enough in all ‘artless’ narrative, including that of 
India. However, in India, if the contrast is one of caste membership 
(¢. g. ‘the brahman’ : ‘the potter’), the caste designations are usable 
and may displace deictic forms in the carrying on of the narrative. 
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In the case of a husband and wife, there are contrastive terms 
that denote nothing more than this relationship (‘husband’ : wife’), 
and this pair is often enough found in narratives displacing the 
contrastive deixis of ‘he’: ‘she’. But sometimes there is found the 
highly redundant addition of caste designation, and, e, g., ‘the 
brahman’‘ and ‘the wife of the brahman’ are the terms used through- 
out the narrative. The latter may even be qualified by a gentive 
denoting the man whose wife she is (‘his brahmani’, ‘your brahmani ) 
or qualified by a genitive denoting a person related to the male ina 
reciprocal status relation (E.g. upadhayini te ‘the wife of your 
teacher’). Such caste designations displace both the ‘husband’—‘wife’ 
pair and the less redundant anaphoric pronoun pair (or deictic pairs 
‘he’: ‘she’). It may be noted, of course, that for the Speakers of 
such a language as Toda, in which the pronouns have no gender 
distinction, the least redundant possibility for keeping male and 
female characters apart in continuous narrative (‘he’: ‘she’) is not 
available. 


Use of the caste designation nouns as vocatives is to be 
expected. Such usage would normally be between members of 
different castes. E.g.in the Mahabharataa brahman is addressed 
by a member of another caste by a vocative form brahman, brahmayga 
(rather rare), vipra, or dvija (or by a superlativized term such as 
dvijottama, dvijavarottama, dvijasrestha, dvijasattrma, or 
vipragrya, all roughly ‘best of brahmans’). Or, a charioteer 
(sita-)18 in the early portions of book one of the Mahabharata 
(whether the individual is Ugrasravas or Samjaya) is addressed 
by a brahman or a ksatriya by means of the vocative sata or 
by saute or sitaja ‘Oson ofa charioteer!’ (saute 1.1.7a, 1.14.la, 
1.53.27d, etc; sitaja 1.48.3d, 1.52-1d, 1-53.34d, etc.). Similar use 
of the vocatives of terms for females has not been found in Sanskrit, 
even though it is to be expected (cf. the Toda Parji examples below). 
It may be that, E. g. in the epic, more honorific terms (E. g. bhavati 
‘O revered lad!’) have inhibited the use of the less honorific, bald 
caste designations. It is possible, however, that rajaputri, vocative 
of rajaputri—- ‘king’s daughter’ (E. g. 3.32.2b in address to Draupadi) 
comes closes to a more ksatriya caste designation, even though it 
insists additionally on regal status. 


A rather unexpected use of such vocatives, or at least of 
brahmana and brahmani, by husband and wife to one another, goes 
alittle further in redundancy and insistence on caste status than 
anything that occurs in reference contexts. J.e. tasya brahmani 
‘bis wife (he being a brahman)’, te ksatriya ‘your wife (you being a 
ksatriya)’? and upadhyayini te ‘your teachers’s wife’ were the limits 
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that were found there. There was no occurrence of ksatriya me 
‘my wife (I being a ksgatriya)’, though sa ksatriya ‘that ksatriya’ 
occurs in that meaning, and (sa) ksatriya me is to be expected. This 
link is lacking in the epic (so far as discovered) between the attested 
reference usages and such vocative usages as brahmana or brahman 
‘O (my) husband (being a brahman)!’ and brahmani ‘O wife (of 
mine, I being a brahman)!’ in aconversation in the Mahabharata 
(see 1.146.22, 26, given below). However, the link is found in later 
prose; see quotations from the VetalapaficavimSati below. 


The redundant identification of characters by their caste 
membership, and redundant reference and address to the interacting 
actors by means of caste designations, can, of course, happen only 
in a society organized by caste. In other types of societies artless 
narrative my have a similarly redundant style (E. g. repetition in a 
narrative of ‘the traveling salesman’: ‘the farmer’s daughter’). My 
impression, however, is that bare anaphora and deixis is at least as 
common, and certainly so in cases where gender differentiation is 
possible (‘he said’: ‘she said’). But, granted that redundancy is 
frequent in artless narrative, it still remains usefull to repeat that in 
India the redundancy is often an identification of caste membeship 
even when such identification is irrelevant to the narrative. Again, 
we note the effect of the pressure of social structure upon the 
linguistic sphere. 


To label these stylistic traits Indian ‘universals’ would go 
far beyond the available evidence. There are very few bodies of 
textual evidence that are extensive enough or well enough recorded 
to be reliable in this matter. There is sufficient evidence in the 
Sanskrit and Pali corpora, and some scanty Prakrit evidence. For Old 
Marathi there is a little available evidence, and Nepali and Bhojpuri 
texts show some for Modern Indo-Aryan. On the Dravidian side, 
there are scraps of evidence in the available texts in Parji, Pengo, 
and a little more in Gondi. My extensive corpus in Kota shows 
hardly any, but I think useful contexts accidentally failed to be 
recorded; this is certainly true of my Kodagu texts. In Toda I have 
a few texts that show the reference usage, and the normal prose 
vocatives between husband and wife involve caste reference according 
to statements by informants, though the prose texts in my records 
fail to show this, probably because of failure of occurrence of the 
proper contexts. 


Evidence is as follows: In Sanskrit literature there seems to be 
nothing to the point in the Vedic texts, whether samhita, brahmanas 


—ee 


Se eee ee Te 
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or upanisad. The grammatical texts, Panini and Katyayana, have rules, 
with examples, for the derivation of designations of females from those 
of male caste members (E g. P. 4.1.49 and Katyayana’s ksatriya-and 
ksatriyani-derived from kgsatriya ); but there are no examples of the 
stylistic uses just sketched. In the epic Mahabharata, however, several 
passages with examples have been found (there are doubtless others 
that I have not noticed). The most conspicuous passage (1.3.85 to 
the end of the chapter) is in prose, the story of the brahman youth 
Uttanka, his teacher (upadhyaya-) Veda and the teacher’s wife 
(upadhyayini-), and king Pausya (referred to both as rajan- ‘king’ 
and as ksatriya-) and his queen (kgatriya-); itis to be noted that 
besides the pair of caste designations, there is also in this passage the 
pair upadhyaya- ‘teacher’: upadhyayini- ‘wife of teacher’, based on 
one of the sub-relations (teacher: pupil) within the brahman caste. 
Relevant passages are the following. 


1.3.89 sa vasams tatro ‘padhyayastribhih sahitabhir ahaiyo 
‘ktah: upadhyayini te rtumati, upadhyayas ca prositah. ‘As he 
(Uttanka) was dwelling there, the women of his teacher’s household 
together sent for him and said: ‘‘Your teacher’s wife is menstruating 
and (your) teacher [or, her husband] is absent from home.” 


1.3.98 tad gaccha. enam pravisyo ‘padhyayinim precha kim 
upaharami ‘ti. [The teacher says to Uttanka, who, after finishing 
his course of study, wishes to make the usual gift to the teacher 
(gurvartha)]: “So go! Enter and ask her, (your) teacher’s wife, 
[i. e. my wife], saying: ‘What shall I offer as a gift?’.”” 

1.3.99 sa evam ukta upadhyayeno ‘padhyayinim aprcchat: 
bhavaty upadhyayena ‘smy anujfiato grham. ‘When he had been 
spoken to thus by his teacher, he asked his teacher’s wife: “Revered 
lady! I have been given permission by my teacher [your husband] 
to go home.’’ 


1.3.100 sai ‘vam ukto ‘padhyayiny uttankam pratyuvaca: 
gaccha pausyam rajanam. Bhiksasva tasya ksatriyaya pinaddhe 
kundale. ‘When the teacher's wife had been spoken to thus, she 
answered Uttanka: ‘Go to king Pausya! Beg of him the earrings 
worn by his wife [Lit. his ksatriya]!”’. 

1.3.101 saevam ukta upadhyayinya pratisthato ‘ttankah. ‘When 
he had been spoken to thus by his teacher’s wife, Uttanka started 
off.’ 

1.3.108 tam uvaco ‘ttaikah: gurvarthe kundalabhyam arthy 
agato ‘smi ‘ti ye te ksatriyaya pinaddhe kundale te bhavan datum 
arthati ‘ti. ‘Uttanka said to him [the king]: ‘Since I have come 
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desiring earrings as a gift for my teacher, be good enough to give the 
earrings that are worn by your wife flit. your ksatriya]!”’ 


1.3.109 tam pausyah pratyuvaca: pravisya ‘ntahpuram kSatriya 
yacyatam iti, ‘Pausya answered him: “Enter the women’s apartment 
and ask my wife [lit. the ksatriya’s wife] !”’ 

1.3.110 sa tenai ‘vam uktah pravisya ‘natahpuram ksatriyam 
na ‘pagyat. ‘When he was spoken to thus by him, he entered the 
women’s apartment, but did not see his wife [lit the kgatriya’s 
wife].’ 

13,111 na hi te ksatriya ‘ntahpure samnihita nai ‘nam 
pasyami ‘ti. [Uttanka said to the king]: “ . For your wife [lit. your 
ksatriya] is not there in the women’s apartment, since I do not see 
her.’’19 

1.3.112 na hisa ksatriya ucchistena ‘Sucina va Sakya drastum. 
[The king said]: “‘... For my wife [lit. that ksatriya] cannot be seen by 
one who has not washed after eating or by one who is impure.”’ 


L315 ease antahpuram pravitya (am ksgatriyam apasyat, 
‘[Uttanka washed], and entered the women’s apartment and saw the 
king’s wife [lit. that ksatriya].’ 

1.3.120 §a evam uktas tam ksatriyam pratyuvaca. ‘When he 
{[Uttanka] had been spoken to thus, he answered the king's wife 
[lit. that ksatriya].’ 


1,3.121 sa evam uktva tam ksatriyam amantrya pausyasakasam 
agacchat. ‘He [Uttanka] spoke thus and took leave of the king’s 
wife [lit. that ksatriya] and came to Pausya’ 


1.3.160 adya that punyakam upadhyayinyah. [Uttanka thought]: 
“Today is that ceremony of my teacher’s wife.”’ 


13316225 pratyajagamo ‘padhyayakulam. upadhyayini ca ..... 
[Uttanka] returned to his teacher’s residence. Aud his teacher’s 
6) eee : 


1.3.163 atho ‘ttankah pravisya upadhyayinim abhyavadayat. 
te ca ‘syai kundale prayacchat. ‘Then Uttanka entered and saluted 
reverently his teacher’s wife. And he presented those earrings to her.’ 


These passages present the caste terms in numerous varieties 
of reference contexts; there are no examples of vocatives. 


Mahabharata 1.145 ff. has a conversation in verse between a 
brahman and his wife. In the surrounding narrative they are 
referred to as viprasya sabharyasya (1.145.17), brahmanam... 
bharyaya...sabitam (1.145.19), vipro bharyaya saha (1.149.119), all 


ijdl Vol. UI No. 1 123 Emeneau 


‘the brahman with his wife (bharya)’. But in the conversation itself 
the wife addresses her husband with the vocative brahmana ‘O 
husband (who is a brahman)’ in 1.46.22c,20 and he addresses her in 
1.145.26b with the vocative brahmaai ‘O wife of (me) the brahman,21 


In the Paficatantra, book 5, the brahman and the mongoose 
story, Edgerton’s reconstructed text (Edgerton 1924), which is 
prose and fairly artless (certainly not kavya), contains, instead of 
anaphoric pronouns in the narrative references to the brahman and 
his wife, the nouns brahmana- and brahmani- (sentences 6, 8, 28, 
29, 30, 41, 42), and similarly in an inserted story about another 
brahman and his wife (sentences 24 and 25). Edgerton indicates 
certainty for the use of the nouns in the original text in the majority 
of these passages. In sentence 41 the brahmani addresses her 
husband with the vocative form brahmaya, but the indication is 
that this was probably not in the text of the original. In the 
Hitopadesa text given in Lanman’s reader (1884; p. 44, line 1) there 
is found tasya brahmani ‘his wife’ with the genitive tasya ‘his’; this 
version has not the claim to antiquity that the reconstructed 
Paficatantra text may well have. 


The prose versions of the VetalapaficavamSati, those of 
Sivadasa (Uhle 1881) and Jambhaladatta (Emeneau 1934), have 
reference usages of caste designations in abundance, independently of 
one another, i.e. presumably not deriving them from ‘an original 
version. Moreover, brahmana- and brahmani- for ‘husband’ and 
‘wife’ are found, both in 3d person reference and used reciprocally. 
The missing link, ‘my brahmani’, which has not yet been found in 
the epic, occurs in Sivadasa’s versions (p. 36, |. 25) gangapare vasami, 
tatra madiya brahmani sthita. ‘I live on the bank of the Ganges; 
there my wife (brahmani) is’; (1. 20) tada madiyabrahmanya 
bhanitam. ‘then my wife said’; etc. Otherwise, Jambhaladatta’s 
version has, E. g.: (p. 28,1. 1) tato rubraSarmana brahmani nigadita 
‘then Rudrasarman said to his wife (brahmani)’; (p. 106, 1.2 from. 
bottom) tatra yajiasomanama brahmaaas tisthati, taaya brahmani 
somadatta. ‘There there lived a brahman named Yajfasoma. He 
had a wife (brahmani) Somadatta.’ One passage in Sivadasa ex- 
pands brahmana-— and brahmani- with grhastha- ‘householder’ (i. e. 
living in the married householder’s stage of life) and grhastha- ‘wife 
ofa householder’, status terms within the brahman life cycle: 
(p. 13, 11. 4 ff.) grhasthena brahmanena bhanitam: grhaathaya 
brahmanya krodhavasat as balako jvalitagnau praksiptah. ‘The 
brahman householder said: “...’’» The wife of the brahman house- 
holder in anger threw the child into the burning fire.’ The vocative 
use by husband and wife to one another is not common in these 
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texts, but Jambhaladatta has one example of brahmani ‘wife;’ 
(p. 28, 1. 1), and Sivadasa several of brahmana ‘husband!’ (p. 36, 11. 
27, 33). Neither of the verse versions of the VetalapaficavimSatl, 
those of Somadeva and Ksemendra, has any of these usages in the 
parallel passages, and possibly there are no examples in these 
versions, 


In Pali the Radha-jataka (Andersen 1917, p.9) has three 
clear occurrences of brahmani-meaning ‘brahman’s wife’ (1.10) 
brahmanassa pana brahmani arakkhita dussila. ‘But the brahman’s 
wife (lit. the brahman’s brahmagi) was difficult to watch and of 
bad morals’; (1.14) so...brahmanim sukapotake paticchapetva 
agamasi. ‘He (the brahman) . entrusted his wife (Jit. the brahmani) 
to the young parrots and went off’; (1.17) brahmano imeem 
brahmanim amhakam niyyadetva gato. ‘The brahman entrusted 
this wife of his (lit. this brahmani) to us and went off.’ How much 
more is to be found that is parallel to this I do not know, nor is 
it known whether there are passages in which the caste designations 
are used in the vocative by husband and wife to one another. 
Moreover, considering the vicissitudes of transmission of the 
jataka texts, it is uncertain wnether we have witness here to a trait 
of the original Pali language of the story, or of the Old Sinhalese 
into which it was translated, or of the rather late Pali of Ceylon 
into which the Old Sinhalese was finally translated (possibly 
influenced by the vernacular of the translator; see Winternitz 1933, 
117). All that we can be sure of is that the trait which was found 
in epic and artless story literature in Sanskrit, survived into the 
Buddhist literature, whatever the language involved. 


Prakrit usages are not easily found (E. g. I have not succeeded 
in finding any in Jacobi’s reader). However, the existence of such 
usage, at least for reference, is guaranted by the use in dramatic 
Prakrit of bhatta (oblique stem bhattin—) ‘master’ and bhattini- 
‘wife of the master’, in such a passage as the following from the 
7th century Ratnavali attributed to Harsa (Cappeller 1909, p. 335): 
uvaladdho kkhu mae bhattino vuttanto; ta java gadua bhattinie 
nivedemi. ‘I must remember that I have received news from master; 
so I will just go and tell master’s wife’. 


Anexample from an older stage of a modern Indo-Aryan 
language is found in an Old Marathi text (Lilacaritra, 1278 A. D.; 
Tulpule 1960, p. 97, text 8, 1. 12): tegé brahmané mhanitalé apuliyé 
brahmanité ‘the brahman said to his wife’; the noun stems are 
Sanskrit, to be sure, but this does not vitiate the value of the 
example. 
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Though the Modern Indo-Aryan languages in general have 
these usages, examples are not to be found in the easily available 
material (E.g. teaching material in Hindi-Urdu, etc.)22. More tradi- 
tional texts undoubtedly contain examples, as does the Nepali 
VetalapaficavimSati (Riccardi 1971). This is a prose adaptation, 
chiefly by expansion, af the verse Vet: in Ksemendra’s Sanskrit Brhat- 
kathamafijari. Though Ksemendra’s style does not offer examples 
of, E. g., ‘husband’ and ‘wife’ denoted by brahmana- and brahmani-, 
the Nepali translation does so. This, of course, occurs in places 
where Jambhaladatta’s and Sivadasa’s prose versions show this trait 
(E. g. Jambhaladatta p. 28, lines 1 f., Sivadasa p. 13, lines 4 ff., 
Nepali p. 59, lines 7f.). But, that there is no influence here from 
the Sanskrit prose versions, is made certain by similar occurrences 
in other passages in the Nepali translation, for which there is no 
parallel usage in any of the Sanskrit versions. One example is 
enough to guarantee this independence of usage, viz. p. 92, para. 
beginning taba, line 8 (translation, p. 162, line 1): bato-ma brahmani 
mari. ‘on the way my wife (brahmani) died’. Such usage of caste 


designations is then a trait of at least literary style in Nepali, and 
probably also of the colloquial. 


In Bhojpuri (Bihat), a story given by Tiwari (1960, pp. 233 ff.) 
identifies and refers to a brahman and his wife as pandit and 
panditain. This is a recording of an oral tale, and guarantees 
colloquial usage. It is to be noted that only reference usage, and not 
that of vocatives has turned up in the Nepali and the Bhojpuri texts. 
Even though vocative usages are well-known to occur in modern 
Indo-Aryan in general, the reciprocal usage between husband and 
wife is uncertain. 

On the Dravidian side evidence is on the whole hard to find 
because of the lack of text material with the proper contexts. 


In several Toda stories the proper context appears, E.g. a story 
of the creation of Todas, Kotas, and Kurumbas, in which, as would 
be expected, the males and their wives are identified throughout by 
the caste designations. Similarly in a story of the relations ofa 
Toda man and his Badaga jajmani clients, he is referred to through- 
out as 0.1 ‘Toda man’ and the Badagas as ma.f ‘Badaga man’. Several 
stories also have o.] and toz, mox in the meanings ‘husband’ and 
‘wife’, when referring to Todas, and others have ma.f and madty 
as ‘husband’ and ‘wife’, when referring to Badagas. Informants said 
that the vocatives used by husband and wife to one another in 
ordinary prose usage are o.lya. ‘O husband!’ and toz, moxya. 
‘O wife!’, i.e. with caste reference; 23 unfortunately, the prose texts 
that I recorded have no examples to guarantee this. 
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My Kota and Kodagu texts have no examples of the kind we 
are interested in, except for (Emeneau 1946, vol. 2, p. 109, sentence 
86) ko.ka.1 ko.v ko.ty elm ‘all the [Kota] men and [Kota] women of 
the Kota village’, which shows redundancy. There are few relevant 
contexts. Itis clear, however, that in neither of these communities 
do husbands and wives use vocatives to one another with redundant 
caste designation. 


Very little has been found in the corpora of Central Dravidian 
texts (which are not very extensive; for some of the languages there 
are no texts). 


For Parji, as has been indicated in note 12, no pairs of male 
and female caste designations have been recorded. In one text 
(Burrow and Bhattacharya 1953, pp. 96 ff) the village priest 
(designated as pelac) and his wife recur as actors in a ceremony, 
and one might expect (since this involves status, if not caste, 
differentiation) a one-word Gerivative term for the female member 
of the pair; but the terms are pelac and pelac ayal ‘wife (ayal) of 
the pelac’. These terms are used throughout the long account of 
the ceremony, rather than merely anaphoric ‘he’ and ‘she’; to this 
extent we have a parallel to the usages we are investigating. The 
vocative for the female term, as used by other villagers, is: ale 
pelac ayal (also: o pelac ayal), This too is a context where a one- 
word derivative term for the female would be expected if the 
language had such terms. 


Pengo instances have been discussed in note 12. In text 17 
(Burrow and Bhattacharya 1970), a story about non-Pengo characters 
brahman, herdsman, and gardener, and the wives of the latter two, 
an Indo-Aryan pair of terms for the gardener and his wife is found, 
E.g. in sentence 30 (p. 182) mari dr malni ‘the gardener and his wife’. 
This, of course, illustrates the borrowing of this trait from Oriya 
into this Dravidian language. The use of a phrase instead of a 
unitary term for the herdsman’s wife-she is referred to as gavrar 
ar ‘wife (ar) of the herdsman (gavren), gavrar being an ‘Oriya 
genitive but af a Dravidian word (DEDS 342)-parallels the usage 
just seen in Parji. The whole passage is symptomatic of probabie 
diffusion directions in this part of Central India, with its borrowing 
of only some Oriya items of the trait. 


. Gondi, on the other hand shows very clear instances of the 
trait, even in the rather unsatisfactory texts in Trench’s account. 
almost the only usable text collection that we have for this language. 
In the grammar volume (Trench 1919), p. 54, ina short narrative a 
herdsman and his wife are referred to as gaiki and gaiké respectively? 


7 a : 
= 
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The second volume (Trench 1921), p. 52, in an account of a Gond ma 
rriage, has the phrase: kumma-kummena...kal karsikum ‘having fallen 
in obeisance at the feet of the potter kummal) and his wife (kummé 
‘wife of potter’), and, p. 53, in a song addressed to pots and dishes; 
kummal babo!, kumme awwal. ‘(your) father was a potter, (your) 
mother was the potter's wife’ Inthe same volume, pp. 3 ff., the 
medium (diviner or shaman), whose status within the community is 
denoted by special terms for himself and wife is called bhimkal or 
bhak / gtal and the wife is bhak / g1@.2 A phrase in the story is: 
bhagte bhagtan . narmsuhchikum ‘the wife of the medium having 
made her husband sleep’; the terms are used in reference through- 
out the story instead of anaphoric forms. No vocatives of these 
terms appear in texts recorded by Trench, either as used by members 
of other castes or by husband and wife reciprocally; in the latter 
case, there are some instances of the use of terms meaning merely 
‘husband’ and ‘wife’ without redundant specification of caste or 
Status. However, an instance of the vocative of a caste designation 
for a female has been found in Elwin 1947, p. 113 bottom, 
in a song sung during marriage celebrations. A procession 
goes to the cowherd’s house and sings to his wife to come out and 
dig earth, addressing her as kope (i. ¢. kope; for this form, see below), 
i. e. ‘cowherd’s wife’ (rather than ‘Kopa [i.e. cowherd] girl’, as 
Elwin translates it). 

The pairs of caste designations in Gondi are well represented 
in the recorded material, E. g.in Burrow and Bhattacharya 1960- 
Especially instructive are those pairs in which the form for the male 
ends in -al (Trench -al) and that for the female in -e (Trench -€>. 
Some are adaptations of Indo-Aryan terms, E. g. Voc. 769 kummal 
‘potter’, fem. kumme, given above (as well as gaiki ‘herdsman’, fem. 
gaiké, note 24). But both endings are Dravidian,27 and there are 
two good examples in which the words as a whole are Dravidian. Voc. 
975 kGpal ‘herdsman’, fem. kdpe, belongs etymologically with the 
verb kip- ‘to herd (cattle)’, which is recorded for Parji. Gadba, 
and Kui, and the verb koh- id. of Gondi itself (DED and DEDS 
1838) whatever the relation of these forms may be to the verb ka- 
in DED and DEDS 1192.) Another such pair is Voc. 2184 panai 
‘Pardhan man’ (i.e. man of aGond sub-caste whose profession is 
singing the Gond legands and entertaining; Hivale 1946), fem. pane; 
etymologically this designation goes with the words for ‘song’ and 
‘sing’ in DED and DEDS 3351 (*pan, *pan). 

Dravidian usage, then scantily recorded as it is, but taken as 


a whole, parallels completely the equally scantily recorded Indo- 
Aryan material. It might be possible to say that similar social 
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structures on the two sides independently produced similar lingui- 
stic results, But, since the social structure at issue iS pan— 
Indic, i.e. in the favorite term, an areal ‘universal’, and is, 
moreover, historically of one origin, there is a great temptation 
to regard the similar linguistic traits, specific as they are in 
s0 much detail as also an areal ‘universal’, a typological feature 
which is likely, given more evidence, to turn out to be pan-Indic. 
If so, it will be impossible to avoid looking for diffusion mechanisms 
and history. The mechanism can only be bilingualism, whatever 
historical construct may be postulated to add greater plausibility to 
this tag. Since the Sanskrit epic has very good examples of all the 
varieties of usage, we Will have to produce a historical construct 
that allows of this early appearance. Our best other evidence for 
all the varieties of usage isin Toda, a language which is found in 
the far southern backwoods, and which may well have been there 
already two millennia ago (Emeneau 1957, p. 50), i.e. roughly in the 
period of the formation of the Mahabharata text as we now have 
it. The implication may well be that the Dravidian usage is that 
old or older, and that the Sanskrit linguistic usage was derived, 
along with the social structure, from Dravidian sources-a conclusion 
that we approached earlier in the examination of the vocabulary 
material alone, perhaps as notably as any in that which has to do 
with labels of habitation units (the Sanskrit borrowings from 
Dravidian in particular) However, it is necessary to emphasize 


again that the recorded material is so scanty that no sure conclusions 
can yet be drawn. Certainly, the Central Dravidian material from 
the backwoods areas of Central India where the aboriginal socia} 
structure is tribal rather than caste, looks like comparatively recent 
influence from Indo-Aryan, even in Gondi where the derivative 
structure of feminine forms is of Dravidian origin. We are almost 
forced to a hypothesis that the Dravidians whom the Indo-Aryan 
invaders met in the riverine plains of North India had a caste system 
with linguistic traits mirroring it, a system which they shared with 
the Dravidians of the plains of the south, while the tribal areas of 
the center had not yet been reached by this stage of social organiza- 
tion (which had, to be sure, developed from the stage which survived 
among them), This is hazardous, but indeed our scraps of evidence 
begin to look this way, 


Footnotes 


“This paper was presented in outline to the Linguistics Group, 
Berkeley, in the spring of 1973. It has benefited by the discussion 
then and since then of my colleagues. I should thank especially 
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Mrs. Usha Jain, who helped me with information about Hindi-Urdu 
usage and translation of passages from Ramchandra Varma’s Acchi 
hindi. All opinions expressed are my responsibility, and no error 
should be laid at the door of my colleagues. 


Notes 


1 Perhaps a good parallel is the replacement of n by min such 
English borrowings from French as pilgrim, vellum, grogram, random, 


* There is no statement or even mention of the usage in Whitney 
(1889), Thumb-Hauschild (1958-9), Mayrhofer (1956), Wackernagel- 
Debrunner (so far; 1896-1964), or Turner (1966). 


3 Edgerton 1924, p. 27, critical note to book 1, sent. 58 


4The data on which my statements are based are found in 
Navalkar 1880 (§§ 120, 123, 295, 532), Master 1964 (pp. 99-100, 
§ 186 [usage 5], pp. 102 ff, §§ 190-1 [usage 4]), Tulpule 1960 (usage 4 
is common in the texts; for variation between jari and jarhi, p. 204). 
Navalkar (§§120, 123) on usage 4 says that it denotes ‘definiteness 
or totality’ (his glosses are of the type ‘both’, ‘all the four’), and he 
specifies that the referends ‘have been mentioned in the previous part 
of discourse’, or ‘constitute the whole number’. Master (§ 190) takes 
exception to Navalkar’s statement as referring to Old Marathi 
(whereas, in fact, Navalkar is describing modern Marathi), but 
Master must be wrong in being unwilling to accept the Old Marathi 
usage, since close examination of his examples and Tulpule’s show 
that they are used in contexts as defined by Navalkar (which are 
of course the contexts of api in this usage in Sanskrit). 

5 There is no evidence to be found in Tagare 1948. 

6 Toda elom is derived from *ella-m (cf. Tamil ellam). This 
element om is analogically added to e.d ‘two’ and mu.d ‘three’, E.g. 
om e.dom ‘both of us (inclusive)’, em e.dom ‘both of us (exclusive)’, 
o (m) mu.dom “all three of us (inclusive)’. 


Notes 

7Kota -dan is cognate with Toda -@on/-don ‘only, just’ 
(= Sanskrit eva), and of course with Tamil tan (@) id. ; they belong 
in DED 2612. 

8For loss of final -m in Tulu, cf, mara ‘tree’ (*maram), the 
3d sg. neuter future suffix -u (= Tamil -um), etc. 


9Kui enga ‘and’ belongs in DED 35ic, with Telugu ika, iga 
‘hereafter, henceforth, still farther, yet’, to which should be added 
also Kolami (SR) inka ‘and’. 
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10This cannot have anything to do with Sanskrit va ‘or’, since 
this is not represented by any descendant form in NIA. 


Singer (1972, p. 323): ‘‘Compartmentalization of cultural 
differences .. isan old, well-known feature of Indian life, Indians 
in fact show such great sensitivity to cultural differentiation based 
on caste, language, geographical region, religion, and life-style and 
have so often categorized these differences with verbal and nonverbal 
markers that they sound like anthropologists when talking about each 
other.” But the ramifications of the use of verbal markers have, so 
far as I can find, not been investigated in detail. 


Notes 


12 There is not evidence easily available for all the languages 
of all three families, or even for many of them. However, for Indo- 
Aryan there are many items recorded in CDJAL. For Dravidian 
there is similarly material In DED and in Shanmugam 1971, pp- 
115-23. For Munda, one may find much Bonda evidence in Bhatta- 
charya 1968. 


Not all the languages for which we have extensive vocabularies 
have in the record paired designations of male and female caste 
members. E. g. no forms denoting females are recorded for the Parji 
language (Burrow and Bhattacharya 1953). It is possible that this 
negative merely means a failure to ask for such forms or to record 
texts with a relevant context, In text I, sentences 18 ff., one might 
expect to find the corresponding feminine for laman ‘man of the 
Banjara caste’, but there is insistence on the woman’s being old and 
consequently the text contains several instances of the phrase laman 
murtal ‘a Banjara old woman’, with feminine indication restricted 
to the word murtal ‘old woman’, 


On the other hand, the Pengo material (Burrow and Bhatta- 
charya 1970) is ambivalent on this point. In anumber of contexts 
and occasionally in the vocabulary, it is clear that there is no unitary 
designation for the wife of a member of a particular caste. E.g. 
p. 181, text 17, sentence 20 has gavrar ar ‘wife (ar) of a herdsman 
(gavren)’, the form gavrar being an Oriya genitive (p. 41, §80), and 
so throughout the remainder of the story; similarly in the examples 
given on p. 41, §80. This is the more conspicuous in that in text 
17, sentence 20 this type of phrase occurs in the larger phrase 
malni gavrar ar ‘the wife of a gardener and the wife of a herdsman’, 
in which malni is the feminine counterpart for mari/mali ‘gardener’, 
the two occurring together in sentence 30 (p. 182) mari ar maini ‘the 
gardener and his wife’; a similar pair is téli ‘oilman’, fem. télni. All 
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these caste designations in Pengo are of Indo-Aryan origin, and it is 
Surprising that the female designations are borrowed in some cases 
but not in others. 


We might suggest that the Dravidian speakers of Central India, 
including Parji and Pengo, lack such pairs of caste designations 
except as they have borrowings from Indo-Aryan, because they are 
‘tribal’ rather than members-of a caste system. This hypothesis can 
hardly have validity, when one compares the findings of Shanmugam 
1971 with the material in Kui (Winfield 1928, p. 20), as well asin 
Gondi, Konda, and Kolami. Some of the formations are Dravidian, 
some Indo-Aryan, whether caste members or tribal members are 
being designated. 


Deficiencies in the record make it uncertain how much of 
this type of material Kurukh and Malto have. The Kurukh diction- 
ary (Grignard 1924b) has a number of items denoting male members 
of castes (Indo-Aryan), with no female counterparts recorded. 


Droese’s account of the Maltos (1884, pp. iii-iv) says that they in 
their backwoods were altogether untouched by the caste system, and 
he records no caste names. In each language, however, there is 
recorded:a pair of designations for the language community, in 


Kurukh the pair kiirkhas : ktire khni, in Malto the pair male: malni. 
It is to be suspected that more really exists, since the feminine suffix 
ni is Indo-Aryan in origin and these words must be based on members 
of Indo-Aryan pairs. 

13Though among the Kotas of the Nilgiris the women are the 
potters, the caste designations (male ko-v, female ko‘ty) are undoub- 
tedly derived historically from a situation in which the men were the 
potters (ko‘v), and ko‘ty meant ‘wife of the potter’; cf. the items 
from the non-Nilgiri languages in DED 1468. 


14Wackernagal-Debrunner 1954, p. 280, §164b, note, records 
that Theodor Benfey (Benfey 1862, pp. 269, 271 f.) stated the basic 
meaning .‘Grundbedeutung’) of feminine formations in -ani-to ‘Fraue 
be [ines] Mannes’; but this is the meaning of all feminines of this 
category, whatever the morphological formation. Benfey’s examples 
were especially proper names such as indrani ‘wife of Indra’ and 
Purukutsani ‘wife of Purukutsa’, and matulani ‘wife of maternal 
uncle (matula)’. 


15See also Mandelbaum 1970, pp. 339-41. 

16The Toda ranking is as I have given it, and in that order: 
Todas, Badagas, Kotas, Kurumbas. The songs (Emeneau 1971 a) 
have much evidence. Particularly good is the set-piece or theme 
seen in the four song-units (two couplets of units): 6w xwi'r olm 
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‘the Todas of all the many (lit. seven) divisions of the clans’, 6w u'r 
ma‘fm ‘the Badagas of all the many (lit.seven) villages’, Gw xwi- 
ko'lm ‘all the seven Kota villages’, Ow muts xurbm ‘the Kurumbas 
in all the many (lit.seven) vilages’. These occur (in several case 
variations) in this order in the passages: 54.13-4, 66.29-31, 112.31 -2, 
114.23-4, 126.29-30. 166.17-9. 


17Sanskrit pakkana-, with DED, DEDS 3332 (Ta. pakkam, 
Te. -paka); cf. Burrow 1948, p. 382, and Mayrhofer, s. v. Sanskrit 
palli-, with DED, DEDS, DEN 3309 (Ta., Ka., Te., etc. palli); cf. 
Mayrhofer, s. v. pally1, with literature quoted beginning with Kittel. 
Nepali khopro and other NIA forms without caste reference CDJAL 
3938 *khoppa-, *khompa ): DED, DEDS 1441 (Ta. kuppam, Ka. 
Tu. koppa, Te, kuppamu). —The quotation of Marathi pada in 
DED 3347, along with that of Sanskrit pataka- ‘a kind of village, 
half a village’, implies that these are borrowed from Dravidian. This 
was the view of Burrow (1948, p. 383), and he was followed by 
Turner in CDIAL, entry 8031, with addition of Sindhi paro, Hindi 
para ‘quarter of a town’, presumably ‘quarter inhabited by a parti- 
cular caste’, to which the Marathi item should be added. The 
Sanskrit word is lexical (though it is also found in an inscription as 
the final element in a village name). When Hemacandra (11th 
century) gives meanings gramardha- ‘half of a village’ and gramai- 
kadeSa- ‘section of a village’, he undoubtedly intends what is 
intended by the lexicographers of the modern languages, viz. ‘village 
quarter inhabited by a particular caste’. This being so, Burrow 1972 
(p. 540) seems unwise in insisting on the wording ‘half’ and finding 
for this medieval lexical word Russian cognates (‘half, moiety’) with 
invocation of Fortunatov’s law (with complication caused by the 
Sanskrit long vowel). The Dravidian origin of this group of Indo- 
Aryan words should stand, with the qualification, of course, that 
Tamil and Malayalam patakam are reverse borrowings from the 
Sanskrit lexicon (note the meaning ‘section of a village’; ‘street’ also 
implies a caste-segregated area). 


18On the sita caste, see Winternitz 1927, p. 315, n. 1. 


19But the reading is uncertain, since the southern mas. have 
sa instead of te (i. e. that ksatriya), and the northern mas., except 
for K[asmiri] mss. and a few of the Djevanagari] mss. have te’ 
ntahpure ksatriya samn:. 


20The text is somewhat uncertain, but all the variant forms 
recorded have the vocative brahman; a vocative, whether of form 
brahmana or brahman, is certain. | 
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21This word is in all the variant forms of the. verse. 


22 Some material undoubtedly is to be found in the Lingui- 
stic Survey of India. At random I hava found in vol. IX. 1 (1916), 
p. 700, a Panjabi story about a man of Jat caste, who says in his 
conversation with a fakir: méri jatani ‘my wife’; the narrative is 
carried on with such sentences as: jatani jatnai kahddi. ‘the Jat’s 
a wife used to say to her husband (list. to the Jat)’. 


?3Emeneau 1941, p. 178. The song usage is quite different 
and is differentiated according to clan membership for males. 


24Not in Voc. nor in Trench’s vocabulary. Cf. Kolami 
gaykiak, (SR) gayki; fem. gaykiral. Marathi gaykya is the origin 
of the Gondi and Kolami words; it is not in CD/AL, but is connected 
with Marathi and Hindi gai ‘cow’, etc (CD/JAL 4147.3, Sanskrit 
gavi-). 

25This pair seems to be recorded only in Trench; not in Voc. 
The variation between k and g represents Trench’s varying spelling. 
Etymological notes are as follows. (1) bhimkal is the shaman 
(‘medium’) probably connected with the bhim, the earth-god, of the 
clan (for which see Grigson 1949, chapter X, pp. 193 ff., though his 
word for the medium, p. 334, is leski [cf. Voc. 3135, DEDS 432}). 
Whether the stem bhimk- is derived from an Indo-Aryan (e. g. 
Hindi) bhimik is still uncertain, even though Platts records bhimik 
as equivalent to bhim(i)ya ‘landlord; village god’. (2) bhag/ktal 
< Hindi bhagat ‘holy man’. Trench consistantly writes t instead of 
t; is this his error, or did his informants really use t? (I am indebted 
to Burrow for his suggestion on the etymology of this word.) On 
the suffix -al in both (1) and (2), seen. 27. 


26These quoted phrases omit marks for vowel length. They 
can be reconstituted, as Trench should have written them according 
to his best practice, with fair certainty as follows: kumma (? 4)- 
kumména ..kal karsikun; kummal babol, kummé awwal; bhagté 
bhagtan narmsuhcikun. This of course not really good evidence 
for the actual phonetics, and not all the long vowels are vouched 
for by other recordings, e. g. € in kummé, bhagté, and gaiké above; 
vos Soa ReaD 


27The masculine -al is so far without parallel in other 
Dravidian languages (see Shanmugam 1971, chapter 2 Gender-number, 
pp. 30-140), nor does it have an Indo-Aryan congener. This suffix 
was recorded by Burrow and Bhattacharya with short ain Voc. 769 
kummal, 975 képal, 2184 panal, etc., and with long a by Trench 
(1921), Williamson (1890), Phailbus (1925), and Subrahmanyam 
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(1968, pp. 31 f., 113 f.); the discrepancy is so far unexplained. It is 
not to be expected that the 1 of the suffix could be Dr. *] (so Burrow 
and Bhattacharya 1960, p. 75 bottom; but otherwise Zvelebil 1970, 
p. 140, § 1. 31. 1.5); ie. derivation of the suffix from *a] ‘man’ 
(DED 342) is phonologically difficult, but still it cannot be completely 
ruled out, so far as we can now judge. 


The feminine -e is recorded with short vowel by Burrow and 
Bhattacharya 1960, but with longé by others including Subrah- 
manyam (i968 a, Adilabad vocabulary, s.v. kumme:, candke:, 
varte:); again there is no solution of the discrepancy. A parallel has 
been recorded in Shanmugam 1971, p, 89, for Naiki of Chanda: 
divre, corresponding to dhivrak ‘fisherman’ (from Bhattacharya 
1961, p. 112; Indo-Aryan, see CD/AL 6819, Sanskrit dhivara—). What 
other CDr. languages might have the suffix is still unknown. There 
can be no Indo-Aryan origin for the suffix. A Dravidian feminine 
suffix *-ay was posited by Shanmugam 1971, pp. 121 f., with some 
doubt. He did not take into account the Gondi and Naiki (Chanda) 
occurrences, and there can be little doubt that he was proceeding in 
the right direction. For the phonology, add this to Zvelebil 1970, 
p. 72 Sl. 17. 2.2, with the additional evidence of the negative forms 
in DED 2106 *cillay (Ta. illai, etc.): Go. (Voe 3414) sille, hille, etc. 


28The partial phonetic similarity of Sanskrit gdpa- and 
gopala— to Go. kdpai should not tempt us to find any historical 
connection. The verb forms are primary in Dravidian, rather than 
the noun. Moreover, borrowing by CDr. from Sanskrit would 
probably have to be much too early to be credible; in Prakrit -p- 
has become -v-, and there are no -p- forms in any succeeding Indo- 
Aryan stage, except for borrowings from the literary language of 
the god-name Gopala (= Krsna) and of the designations of his gdpa 
and gopi devotees (cf. CDIAL 4293 gopald- and 4289 gip4-). 


International Journal of Dravidian Linguistics 


THE GENESIS OF A LINGUISTIC AREA* 
F. B. J. Kuiper 


1. A German scholar of a formermer generation once 
remarked that there can be no more important task for the Sanskrit 
philologist than to describe the changes that have taken place, in 
the course of the ages, in the mentality of the inhabitants of India.1 
What he referred to was the slow but steady cultural process of 
Indianization of those Aryan tribes who had once, in a prehistoric 
period, invaded India from Iran. The same thing might be said of 
the Indic languages. Here, too, there has beena slow process of 
Indianization, which brought Indo-Aryan, the language of the 
invaders, more and more into harmony with the languages of the 
indigenous families, in particular Dravidian and Munda. 


As far as modern Indo-Aryan languages are concerned, this 
fact was recognized about a century ago by Caldwell, the founder of 
Dravidian comparative linguistics. When he remarked, in 1875, 
“that the direction into which those vernaculars have been differenti- 
ated from Sanskrit has to a considerable extent been non-Aryan’’,2 
he was among the first to state, implicitly, the existence of an Indic 
linguistic area. 


Although similar phenomena were incidentally pointed out in 
Sanskrit, the remarkably classicistic attitude of most Sanskrit philo- 
logists has seriously hampered a correct interpretation of the earliest 
developments of Indo-Aryan, in the period when it began to develop 
independently, in complete isolation from its Iranian cognate. It is 


*Text of the Collitz lecture delivered at the summer meeting of the Linguistic 
Socity of America at Ann Arbor, July 30, 1965. I am greatly indebted to 
Professor Hans Kurath, who was so kind asto read the typescript and to 
suggest many corrections in the English text. (Reprint) 


1 See Hermann Oldenberg. Vedaforschung (1905), p. 53. 

2 Caldwell, A Comparative Grammar of the Dravidian or South Indian Family 
of Languages, 2nd ed, (1875), I, p. 57, 3rd ed, (London, 1913; Madras, 1961), 
p. 53. The ist ed, (London, 1856), p. 38 has a slightly different wording. 
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for this rerson that a historical account of this earliest stage of 
Indo-Aryan, which at the same time marked the beginnings of a 
process of convergence in the Indic area, may be useful. And such 
an account will necessarily have to start with the problem of the 
Indo-Aryan _retroflexes. 


2. It goes without saying that a problem on which so much has 
been written can only briefly be summarized here. As far back as 
1833 August Friedrich Pott pointed to the high frequency of the 
retroflexes in the autochthonous languages of India, which led him 
to assume that in Sanskrit ‘“‘a great multitude of words with such 
sounds have certainly been adopted from the autochthonous 
languages’. In the middle of the nineteenth century Caldwell, 
Morris, Benfey, and Ascoli affirmed the Dravidian origin of the 
Indo-Aryan retroflexes, but as early as 1864 Buhler controverted it.> 
In 1906 Konow® briefly referred to ‘‘a long discussion as to whether 
the Aryans have adopted the cerebral letters from the Dravidas or 
developed them independently”. His own conclusion is that “‘it is... 
quite possible that the Indo-Aryan cerebrals have been developed 
quite independent)y’’. On the other hand it seemed probable (in his 
opinion), that the Dravidian influence has ‘‘at least given strength to 
a tendency which can, itis true, have taken its origin among the 
Aryans themselves ’” These words bear witness to the impossibility, 
in the beginning of this century, of dealing with this problem 
methodically. It is characteristic that a few years earlier Konow, in 
a rather speculative article in which all conceivable traces of Dravidian 
influence on Indo-Aryan were enumerated, was still more cautious 
and pointed to a possible parallel in modern Norwegian, where a 
retroflex ¢ has developed from rt? 


Much later, in 1925. Jules Bloch stated that “‘there is nothing 
to justify the assertion that Indo-Aryan cerebrals are of indigenous 
origin’ §: but he changed his position five years later, admitting 
(like Know) that the action of the substratum, though of minor 
importance, ‘has helped to hasten and fix the results of an evolution 
anterior to the contact of both languages’’.® 


3 Etymologische Forschungen, 11 (1833), p. 88f., I11 (1836), p. 19; ef. p. 453. 

4 Caldwell, A Comparative Grammar...2nd ed.(1875),1,p.55; Il, p.32 with references. 

5 Wackernagel, Altind. Gramm., I (1896), p. 165, refers to Biihler, Madras literary 
Journal, 1864. 

6 Linguistic Survey of India, 1V (1906), p. 279. 

7 “Notes on Dravidian Philology”, Indian Antiquary, 32 (1903), p. 455. 

8 BSL, 25, p.6; English translation by P. Ch. Bagchi in Pre-Aryan and Pre. 
Dravidian in India (1929), p. 40. 

9 BSOS, 5 (1930), p. 733. Cf. L’ Indo-Aryen du Veda aux temps moderns (1934) 
p. 227 (“‘incontestable”’), ; 
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These words, which apparently pretend to indicate the exact 
dosage of a foreign influence in a prehistoric development, may 
strike us as curious, but they were no doubt prompted by the chrono- 
logical difficulty which Bloch must have felt at that time of admitting 
4 Dravidian influence in pre-Vedic times, Unfortunately, this 
difficulty has never been clearly stated in any article I know of. If 
it had been, the discussion could have become more substantial. 
The neglect of this aspect is one of the reasons why the progress 
made since 1864, when the controversy arose, is not material. It 
explains, to some extent at least, how it was possible that in 1956, 
when Emeneau, in his well-known article on “India as a linguistic 
area’’.19 accepted the substratum-influence and rightly stressed the 
important role of the bilinguals in this process, two other eminent 
scholars, without knowing his study, rejected any such approach. 
Renou, in his Histoire dela langue sanskrite,+* expressed his strong 
scepticism with regard to any traces of foreign influence in the Rigveda 
except for a few loan-words. He apparently considered the thesis 
of a Dravidian influence in the genesis of the Indo-Aryan retroflexes 
sufficiently refuted by the mere fact that the retroflexes already 
belonged to the Vedic phonemic system. Here the implications of 
the chronological argument are evident. Leumann in his review of 
Renou’s book12 expressed his approval and pointed out that all 
innovations in Sanskrit have been carried through with linguistic 
means inherited from proto-Indo-European. 


Since the notion of linguistic calque itself was not questioned, 
the point at issue apparently is, whether or not this should be 
restricted to the vocabulary. There is evidence enough to prove that 
it should not. Several students of Middle Indo-Aryan have been 
struck by the curious construction tassa agatakale (approximately 
‘“‘at the time of his having come’’), which occurs in a Pali text written 
about 500 A. D. in Ceylon.1* The construction, inconceivable in 
older Indo-Aryan, must be a mere calque, directly or indirectly, 
of the corresponding Tamil expression avan vanta polutu “when he 
had come’. In Sinhalese, the New-Indo-Aryan language spoken in 
Ceylon, the older ablative case ending was replaced in the tenth 
century by a morpheme which properly meant “standing in’. Again, 


10 Language, 32 (1956), p. 7. 
11 Historie de language sanskrite (1956), pp. 16, 29. 
12 Kratylos, 1 (1956), p. 156. 


13 Cf., e. g., Emeneau, Language, 30, p. 484, 32, p. 10. Hans Hendriksen, Syntax 
of the Infinite Verb-Forms of Pali (1944). p. 151 ff., tried to explain the construct - 
ion by analogy. Emeneau found also instances in Buddhist Hybrid Sanskrit. 
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Sinhalese gimahi sita ‘from the village’’ is obviously a mere calque 
of Tamil Grininru.14 Cases of this kind are not uncommon in the 


Indic linguistic area. 


To sum up, no agreement has yet been reached after a 
discussion extending over more than acentury. There are several 
points which require some clarification. The assumption of Dravidian 
influence requires strong arguments since borrowing of phonemes is, 
according to very experienced dialectologists, rare. Most discussions 
in the past were too abstract and tended to deal with generalities 
rather than with concrete facts. A telling example is the cavalier way 
in which such an eminent linguist as Otto Jespersen did away with 
the theory of Dravidian influence by a mere reference to modern 
Norwegian.15 Phonetic developments have been reconstructed but 
phonemes occur in words, and in this case it may be particularly 
interesting to determine just in which words retroflexes occur in the 
oldest extant text, viz. the Rigveda Finally, there is a problem 
of chronology which, for the very reason that it has never been 
explicitly stated, calls for clarification. 


3. It is necessary, therefore, to subject the entire Rigvedic 
evidence to a more detailed examination. 


First, there is the sibilant 5, which in Sanskrit ranks with the 
retroflexes.16 It is an inheritance from proto-Indo-Iranian, being 
an allophone of s after i, u, r, and, k, and before ¢, of the palatal 
sibilants §and z’, E. g. in ist4- and asta-. Besides, it occurs in a few 
words of apparently foreign origin, which raise the allophone to the 
rank of a separate phoneme; cf. casa'la-, jalasa-, asatara-.17 


The voiceless stop t occurs after retroflex s, as in ista’-, asta’ 
and in word-final position, to which [ shall return later. Besides, 
it is found in 17 words of evidently foreign origin, such as ki’kata-, 


14 See my note, Lingua, 8 (1959), p. 335, ina review of D. J. Wijayaratue, History 
of the Sinhalese Noun. 


15 Language, Its Nature, Development and Origin (1922), p. 197. 


16 The lateral 1, allophone of d, isa dialectal feature of the Rigveda only and 
will be left out of consideration. 


17 Cf. kavasa- VIT1, caga’la-vant-Ill1 (cf. casa’-I1), jélasa- 112 VII2 (vdsat 
Il1 VII3); dsatara-l1, ca’sa- X1, baskaya- 11. New words from the Atharva- 
Veda are: a’sa-, kalma’sa-, kaskasa-, jas4-, jalasa-, jaskamada, masmasa’- 
ma’sa-, yévasa-, 
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kévata-, biri'ta-, bekana'ta-, vatyrin-.18 Distinctive word pairs like 
kitas ‘“‘whence’ versus ktitas (perhaps a proper name) are rare. 
Some of these words used to be traced, or are still traced, to Indo- 
European. From the Rigvedic doublets kartd-and kdta- “hole’’ it 
was inferred that the cluster rt could, as the result of a phonetic 
development, incidentally become a retroflex t. This is, it is true,.a 
normal sound change in much later times but, apart from all etymo- 
logical considerations, in Vedic dentals were not affected by a _pre- 
ceding r'® and retroflexes were even clearly avoided in its 
neighbourhood. 2° 


The aspirated voiceless stop this very rare and occurs only 
in words of unknown origin.?! 


The voiced retroflex d occurs intervocalically, not only in the 
well-known Indo-European word for “nest” (Sanskrit nida’-from 
*nizd4-) and four Indo-European root morphemes, but also in three 
more roots, which are obviously non-Aryan.22, The twelve words in 


18 Cf. (rent-) kakata- VI1, ki’kata-Il1, kévata- V11, bi’rita- VII1; aratva- 
VIII1, bat VIIE1 (cf. bd/ ittha’, bala’), bekana’ta- VIIT1; aghati- X1, itdtah 
X1, kagwka-X1, kata-I1, kuta-ll, ku’ta-X1, kr’pita-X1, vatirin-I1, 
vikata- X1, Sakati'-X1; for Srausat I1, cf. vdgat. New words from the 
Atharva-Veda ere: (4ratu-parna-), arataki’-, avata- (X. 134. 6), ita-, vikanati’- 
(mukha-, cf. vaikankat4-), kita-, kurttin-, kitd-,kr’kata-, tiritin-, patara-, 
pataur4 (patira), pata’-, phat-, matmata-, lala’ta-, 8arkéta-, 8a’rkota- 

19 H. Reichelt, Stand und Aufgaben der Sprachwissenschaft (1924), p. 251. 

20 See Wackernagel, Altindische Grammatik, 1 (1896), p. 173. The theory im- 
plies (1) the occurrence of Prakritisms in the Veda, (2) weakened articulation 
of randr, although only a very few instances can b2 quoted for this weaken- 
ing, (3) an early retroflex articulation of t after r, which, however, is strongly 
contradicted by the evidence. Out of the Rigvedic words supposed to have 
-at- from -rf- only two deserve notice, viz. katuka- “pungent, bitter, biting”, 
which is traditionally connected with Lith. kartis “bitter’, and vikata-, a 
hapax of unknown meaning in book X, which is supposed to stand for *vikrta- 
(see however, Fs. Debrunner, p. 247 n. 21 and Fs. Kirfel, p. 171). Counterin- 
stances are, however, kitav4-, if standing for *krtav4- (Wackernagel), and 
Sithira-, if representing *rthird-, [vikata in the sense “monstrous” is later, 
of course, in current use.] 

21 Cf. jathéra- 31 pithinas- Vil; jathara- {1 , jathala- V1, “patharvan- 11 , 
irimbitha- X1 (cf. irimbithi- Anukr.) 

22 The different distribution of the inherited and the non-Aryan words over the older 
and more recent books is instructive. Cf. on the one hand nilayati VI, mrlati 
mr layati (passim), vildyari, vilitd- 113 W12 VI3; VIL and vi/u- W3 IV!, 
VI1; VIII6 ;16 X2, hélate (etc.) Ailita- 11 VII VIIS, etc,, on the other 
hand kri‘lati 1V1 V2; VIII! [X12; 11 X4, krila-, krilt-, krild-; krilumént- 
1X1: 13 X4, pipilé 1V1, kilayati VIM, nali‘- X1, Cf. also meli- II! IVI . 
New words in the Atharva- Veda are: kaphauda’- (v. I. -od4-, etc.) kroda-, 
khadii’ra-, jaigida-, ta’da- (? cf. Altind, Gramm., 11/2, p. 62 but Burrow- 
Emeneau, Drav, etym. Dict., Nr. 2466), (nadika'-) $a‘luda- (in VIIL.6.17 udu- 
mbélath tundélam uta’ $4’ludam), hridu- (vv. 11. htdu-, hridru-, rudu-). 
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which it stands aftera nasal, such as dandd-, must all have been 
borrowed from a foreign source.23 


The voiced aspirate dh occurs in some seven inherited words, 
where it has arisen from *zdh.24 In foreign words aspirate is rare: 
there is probably only one instance.?5 


Finally, the retroflex nasal, originally an allophone of the 
dental n after r and s became phonemic in a few words, aftera 
preceding voiced sibilant had disappeared, e.g. in dindsa- (from 
prehistoric *duzna’Sa-). Apart from these two or three words, it 
occurs in the cluster nd in the twelve foreign words just mentioned, 
and besides, intervocalically, in some thirty-five words2® of presu- 
mably foreign origin, such as gana-. Most puzzling are three inherited 
words, where it stands for dental nasal without any apparent 
reason.27 


4. From this account it follows that the role of the foreign 
words in the expansion or establishment of the new phonemes may 
have been particularly important. This raises the problem of the 


23 Cf. dandd- VII! , pinda- in hiranyapinddé- VI! , manditi’ka- VIL! TX! (i’X! ); 
martanda- II! X2, ra’ndya- (v. |. ra’ndrya-) VI). Sandika- Ill. §anda- Vil, 
akhandala- VIII!, andd- VIII2. J! X11, (cf. martanda-), kunda- (pa’yya-} 
VIiI1; pinda- 1) (cf. hiranyapindé-) kundrna’ci- 1. pundarika- X1, mand- 
tra- (dhaniki-) X!. New words in the Atharva- Veda are: urunda-, ka’nda-, 
canda-, tandula-, tundika-, tundéla-, dindana-, pandaga-, banda-, $and- 
adurva-(?), Sikhandin- (cf, ni’la-Sikhanda-), Were also occur the first words 
with nt(h): kantaka- (v. 1.), sahakanthika-. 

24 Cf. gii|hd-, tr]hd- drlhd-, ba]hd-, milha-, (4-)rilha-, (4-)3a]ha- (cf. sa’Jhr-). 

25 jalhu-. 

26 ani- V1; 12, (D48) oni- VI; Xt (Désonya- Khila oni- IX3;1! ) kalya’na- 
(-i-), TI! IV1 511 Xt, kunaru- W111 , gané- 39 and compounds (gdnya III! ), 
ninik IV1 (ninyd- IIL! IV1 VII3; 1X1; 13 X1), pani 50 punya-gandha- VII) 
phan-, apaniphanat IV! (Aphagnyat VIU1), band- VU, vanij- V1;11, vand- 
IV1; Il (the interpretation of the words vaN4é-and va’Ni-is problematic), §éNa- 
Wil? Vt; TXt;12 X!, The following words are characteristic of the eighth 
book: anulba) 4- VIII! (apparently different from anulban 4-X!) kd ava- kanyd- 
VII? (praskatva-VIII3 11), kanukaé-VIIM! , jatijana-bhdvant-Vi\ , nicumpunh & 
VIIFt ; a word of the ninth book is 4Qva- (-i-) 7X5; J1. Words not attested 
before the two last books (I and X) are: ogaha- X?, kKaNiikaydantih X1 kané- 
Xi, -dhaniki- X1! veNl- X11, sthanu- X1. For the reconstructed words 
*ambhaa-, *kuQdaga‘ci-, and alataha- see below, p. 88. Atharvavedic words 
are: eQi-, kajla- ki Qapa-, khaNvaka-, gua-, pratipha-, pratipana-, manr- (2), 
lavana-, $loNd-, (2 cf. Altind. Gramm., I p. 193, If2, p. 736) Cf. a’manako 
ma Qatsakah XX. 130.9. 


27 sthi’Na- “pillar” (Av. stina-) mani- “gem”, and paQi-“hand”. Cf. also Sana- 
“hemp” in the Atharva- Veda. 
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criteria for declaring a Rigvedic word as borrowed. Only in passing 
can this be mentioned here, as it is impossible to enter into technical 
details. Instead, a more general remark must suffice. It js still a 
common practice of etymological dictionaries to reject a non-Indo- 
European etymology for a Sanskrit word ’for no other reason than 
that the word is already attested in the Rigveda. Therefore, I should 
like to’ point out-although it is unnecessary to stress this point ina 
meeting of linguists-that adoptions can occur in aay language and 
at any time, and that it would be surprising if there were none in the 
Rigveda, Despite the general disinclination of Vedologists to accept 
the existence even of incidental non-Aryan loan-words in the Rigveda 
it does not seem quite realistic to look upon early Indo-Aryan as 
a language spokenin a vacuum. There must have been, from the 
earliest times, contacts between the Indo-Aryan invaders and the 
autochthonous population, and the possibility that some foreign 


words found their way from colloquial speech into the higher idiom 
of the poets can hardly be denied. 


There remains, it is true, the question as to which are the 
criteria for declaring a word non-Aryan. This is a technical problem, 
which cannot be dealt with here. Although an attempt has been 
made to sift out more or less methodicaliy the majority of the foreign 
elements in the Rivedic vocabulary, this has not yet been the subject 
of a general discussion. Nor could it possibly be otherwise because 
hardly any Vedologist is equipped for this kind of research. With the 
reserve, therefore, that all conclusions in this field must be considered 
provisional, it may be stated that in the oldest books of the Rigveda 
CII-VII) there occur about 35 words of foreign or doubtful origin 
which contain a retroflex. Next in the chronological order comes the 
eighth book, which stands somewhat apart because it shows traces ofa 
more profound non-Aryan influence. It has 12 new foreign words with 
retroflexes, three times as many as any of the older books had in- 
troduced.28 In the two last books (I and X) the newly introduced 
foreign words number 38. The total number for the whole Rigveda 
is approximately 90. 


The steady, though not spectacular, increase of words with 
retroflexes continues in the later and more popular Atharva-Veda, 
which has some 50 new borrowings. Very many of them are hapax 
legomena, but this very fact points to a general conclusion. If, indeed, 
in the priestly idiom of these hymns so many foreign hapax legomena 
occur, it may safely be inferred that in the contemporary colloquial 


~ 28 For retroflexes in this book see Karl Hoffmann, Worter und Sachen, 21 (1940), 
p. 149; for traces in book VIII of a non-Aryan civilization see An Austro-Asiatic 
Myth in the Rigveda ( = Meded. Kon. Ned. Akad. van Wetensch., N.R.13/7)(1950) 
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speech such occasional borrowings, probably restricted to very 
definite periods and areas, must have been much more frequent. It 
may be added that several non-Aryan names of Vedic theologians 
such as Kavasa Ailsa prove that members of autochthonous tribes 
had been adopted into the Aryan community. At that time the 
distinction between Aryan and non-Aryan apparently had a socio- 
religious, rather than an ethnic, character. 


Some linguistic reflexes of a socia! differentiation can perhaps 
be detected in traces of a puristic counter-movement found in the 
Rigveda. Iwill not stress this point because, again, this conclusion 
is based on an interpretation of certain facts which so far has not 
been the subject of a thorough discussion and cannot be said to have 
been generally accepted. In all the words concerned an r seems to 
have been secondarily inserted owing to the occurrence of a retroflex 
consonant in these words. 


By the side of the Rigvedic verb kiidayati (kilayati), which has no 
satisfactory Indo-European etymology, a later Vedic text has Aradayati 
with inserted r. For Rigvedic randya-, a hapax of unknown meaning and 
origin, Some manuscripts have the variant reading ra’ndrya-, with 
dental nd and insertion of r. Beside the word kata- ‘‘hole’’, which 
for etymological reasons seems to be the original form, there occurs 
a doublet karté-, in which r is likely to be secondary. For etymolo- 
gical reasons Rigvedic ambhrna-seems to be a hypercorrect form for 
ambhana-, which is attested in later texts. Also in kundrna’ci-and 
alatrna-the cluster rn has been explained as standing for an. In 
other hypercorrect forms the Rigveda has a dental for what in the 
later language is a retroflex. 


If this theory of a secondary insertion of rin foreign words 
with retroflexes is correct?9 -—and the corroborative evidence of 
similar cases in later Sanskrit is very strong-, these facts reflect 
different social levels, and a reluctance of higher classes to accept 
foreign words containing retroflex phonemes without some previous 
Sanskrilization. However, the procedure of legalizing them by the 
insertion Of anr is curious and poses problems of its own, which 
lie outside the line of this argument. 


29 Seee.g. Bloomfield-Edgerton, Vedic Variants, IT (1932) p. 296, H. Oertel, 
Festgabe-Jacobi (1926), p. 25, Zur Kur Kapisthala-Katha.Samhita ( — SBAW, 
1934), p 36, Kuiper, Festschrift Debrunner (1954) p. 242ff., Turner Felicitation 
Volume, I (— Indian Linguistics, 1958), p. 351 ff., Museum, 59 (1954), p. 120 
Festschrift Kirfel (1955), pp. 149, 177 ff. For garta-beside karta-see also Lokesh 
Chandra, Jaim, Br. IT, 1-80 (1950), p, 23, 
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5. So much for the facts. As far as their explanation is 
concerned, I must apologize for repeating a few things which others 
have stated before with perfect clearness. 


For the arguments which led to explain the new set of 
phonemes as the result of a so-called natural internal development 
of Indo-Aryan we have to go back to 1934. In that year Jules 
Bloch gave his last exposition of the problem3° and, although he 
was ready, then, to concede some subsidiary influence of the 
Dravidian phonemic system, he maintained that the retroflex stops 
(to which I shall have to confine myself here) have arisen in Indo- 
Aryan from three different sources. Since the present-day rejection 
of any Dravidian influence must be based on the same arguments, 
it is necessary to consider them more closely. I shall first discuss 
two of them. 

The least important and most problematic argument is based 
on the occurrence in the Rigveda of the two synonyms karté—and 
kata-, the last of which (with a retroflex t) is supposed to represent 
a phonetical development of the first. Since the theory of such 
a development has been contested and kartd- is one of the words in 
which the r can have been secondarily inserted, the argument is at 
best a doubtful one. Besides it accounts only for a few isolated words. 


The second, and most important, factor is the assimilation 
which is supposed to have changed the dental ¢ to a retroflex t after 
sin words like ista- and agtd-. It need hardly be pointed out 
nowadays that this phonetic explanation disregards the real problem. 
“The different articulatory positions of the final t in English bat and 
bashed’?! do not imply a phonemic contrart in English. Nor was 
there any such contrast between the ¢ in Avestan asta- ‘“‘messenger’’ 
and asta- “hurled, thrown’’, while there certainly was one in Vedic 
between the retroflex t in asta- and the dental tin asta- So the 
difference that exists in this respect between Old Iranian ista-‘“‘desired’”’ 
and Indo-Aryan istd-, far from explaining the emergence of the 
retroflex ¢, rather presupposes its existence as a separate phoneme. 
In Indo-Aryan, and only here, an allophone must have been raised 
to a phonemic status. 


Now, froma Pan-Indian point of view it is apparent that this 
prehistoric development of Indo-Aryan may be looked upon as a first 
step taken in the direction of a Pan-Indic convergence. If so, it is no 
longer possible to ignore the development that has taken place in a 
different linguistic family of India. It has often been pointed out 


30 L’Indo-Arycn du Veda aux temps modernes, pp. 53-57, 325. 
31 Turner, JRAS, 1924, p, 556. 
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that nowadays every language in India has a set of retroflex stops, 
and this is also true of the Munda family?2 -with the sole exception 
of one or two of the southern languages of this group. Their distri- 
bution in Munda is certainly not the same asin Dravidian, nor does 
Munda have the retoflex nasal of Dravidian, but their phonemic 
status is as firmly established here as it is in Indo-Aryan, Dravidian 
or Burushaski. Still, the comparison with other Austroasian languages 
points to the conclusion that they must be the result of diffusion 
throughout the Indian subcontinent—perhaps at an early date, as some 
Munda loan-words in the Veda might indicate. 


In later times the gradually growing importance of the retro- 
flexes in Indo-Aryan fits in very well with the general tendency 
towards convergence in the Indian area. This convergeuce is quite 
clear at a much later time, when the retroflex din Hindi developed, 
in postvocalic position, to same retroflex flapped sound that also 
exists, partly with the same origin, in North Munda. 


It may seem natural, then, to assume that in the same way, in 
prehistoric Indo-Aryan, bilingual speakers who recognized a phonemic 
contrast between dentals and retroflexes in the foreign language, 
came to interpret the allophones of proto-Indo-Aryan in terms of the 
foreign phonemic system. The loan-words with retroflexes which - 
at least in my interpretation of the Rigvedic evidence-they must have 
introduced into Indo-Aryan may have contributed considerably to 
the spread of this novel phonemic distinction among the speakers of 
early Indo-Aryan. Emeneau clearly formulated this conclusion in 
1956 when he stated that pre-Indo-Aryan and pre-Dravidian bilingu- 
alism provided the conditions which allowed  pre-Indo-Aryan 
allophones to be redistributed as retroflex phonemes.33 


Most linguists will probably agree with Emeneau that the 
correctness of this explanation is, as he put it, ‘beyond doubt’. 
Nevertheless, some difficulties remain, 


First, the assumption of such a foreign influence conflicis 
fundamentally with the traditional picture of a purely Aryan society. 
Although this picture has been defended with modern arguments it 
is basically an inheritance of a romantic and idealizing attitude of 
19th- century scholars, which can hardiy be considered relevant. Still 
amore ample demonstration of Emeneau’s thesis would seem 
desirable. 


32 See H. Pinnow, Versuch einer  historischen Lautlehre der Kharia-Sprache 
(1959), p. 289 ff. 

33. Language, 32 (1956), p. 7. Cf. my conclusion in India Antiqua (1947), p. 212; 
“as far back as the prehistoric period’’. 
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More important is the fact that we are here faced witha 
methodological dilemma. Indeed, historical linguistics shares some 
characteristics with the science of history in general. The historian 
interprets the past in the light, and with the knowledge, of what 
happened afterwards. This is what we are doing when, in the light of 
later developments, we interpret pre Vedic developments as the first 
Steps taken in the direction of a convergent evolution. However, 
how far are we entitled to trace back the line of this evolution into 
prehistory? If it may be taken for granted that convergence was a 
real factor in the development of Indo-Aryan of, say, five centuries 
ago, have we a right to interpret in the same light phenomena of ten, 
or twenty, or thirty centuries ago? Can we entirely exclude the 
theoretical possibility that a development started as a fully auto- 
nomous process, only to become, at a much later time, a factor in the 
context of a general Pan-Indic evolution? In other words, can be 
sure that our interpretation of the pre-Vedic evolution is more than 
an unwarranted extension of the observer’s own perspective to periods 
that should be kept apart? 


To this basic dilemma of historical interpretation there is, as 
far as I can see, only one answer. Any such interpretation of an 
isolated phenomenon will to some extent remain problematical as 
long as we are unable to correlate it with similar interpretation of 
other, apparently contemporary, phenomena. 


There is a third difficulty. Apart from the position after s, 
as in ista’— ‘‘desired’’, the retroflex t occurs in some noun and verb 
forms?4 in word-final position, e.g. in ra’/t, gen. rajah ‘‘king’’, 
which corresponds to Latin réx, regis. Now, many Indo-Euro- 
peanists think that this retroflex represents an older palatalized stop 
of proto-Indo-Iranian and must, there-fore, date back to a remote 
prehistory.35 If so, its early origin enables the adherents of this 
theory to argue that, oncea retroflex had arisen from this Indo- 
Iranian palatal stop or affricate, this could lead to a redistribution 
of allophones in such cases as ista-. 


I need not explain why this theory seems to me unacceptable, 
and why I think that the final retroflex is rather the result of the 
redistribution. What matters is that the theory exists and that the 
controversy which arose a century ago is bound to go on indefinitely. 


34 See below, p. 103ff. 
35 E.g., J. Bloch, BSOS, 5 (1939), p. 733, who however, while explaining 
sa’t “six” from *satg, disregards the fact that sodha’ “*six times’’ points to 


*sa$ + dha’. 
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unless decisive proof for the Dravidian “origin” of the Indo-Aryan 
retroflexes can be furnished. This decisive proof must consist in 
providing cumulative evidence for the view that the tendency of Indo- 
Aryan for conforming to Dravidian patterns actually dates back to 
prehistoric times. In the light of this evidence it will, I think, no 
longer be permissible to ignore the wider Pan-Indian perspective 
with regard to those innovations of Indo-Aryan that have striking 
parallels in non-Aryan languages of India. 


6. One of these parallel innovations, which curiously enough 
has never been mentioned in this connection,?6 is the use of Vedic 
iti ‘thus’. In order to make the character of this innovation clear, 
it may be convenient to start from Old Iranian. 


In Avestan there is a word uiti ‘‘thus’’, which can be used 
in the meaning ‘‘likewise’’ but mostly occurs before direct speech, 
in expressions like ‘“‘thus he said’’, ‘“‘saying thus’’. Sometimes these 
words are inserted into the quotation but they never follow after it. 


In the Rigveda there is a parallel formation iti, which occurs 
70 times. I leave aside ten passages where it simply means ‘‘thus, 
then, likewise’, like Latin ita.37 In the remaining 60 passages a verb 
of saying or thinking is expressed or may be understood. However, 
in striking contrast with the Avestan usage, the words “thus he said’”’ 
etc. stand only six times before the direct speech,38 but 30 times 
after it. Obviously there has been a shift.39 The resulting con- 
struction, although unknown in Old Iranian, would not seem 
impossible as Indo-Aryan sentence structure. A simple English 
equivalent may illustrate what has happened. A literal translation 
of the first type would be 


(1) Thus he said: “I shall come’’ (6 + 1) 
The second type, 


(2) “I shall come’, thus he said (30), 


while still being conceivable as an Indo-Aryan sentence, has at the 
same time become entirely identical with the only construction by 
which this can be expressed in Dravidian. Here, indeed, indirect 


36 A brief reference to this word can be found in /Z/, V (1961), p. 81. 

37 According to Geldner’s translation. Cf. e. g., X. 120.4a tti cid tva ... anuma’- 
danti “Ebenso jubeln ja dir...zu’’. Delbriick, Altind. Syntax (1888), p. 529, 
supposed a verb of saying to be implied in these passages. For the Rigvedic 
evidence see the Appendix. 

38 Also in later Sanskrit literature it rarely stands here. see Speyer, Vedische und 
Sanskrit Syniax (1896), p 93. 

39 Delbrtick, Altindische Syntax (1888), pp. 23, 531, Speyer, Ved und Skt S yntax 
(1896), p. 92ff. 
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speech is unknown, the only equivalent being direct speech, followed 
by a word which means “having said’’, e. g., Tamil enru Kannada 
endu, Telugu ani. The general structure of the Dravidian sentence 
makes it pretty certain that this has always been the genuine Dravidian 
equivalent for indirect speech. 


Nevertheless it is a drawback that we cannot compare the 
syntax of the Rigveda with contemporaneous Dravidian texts. The 
oldest Dravidian texts that we know are those in Old Tamil. They 
probably date from about the second century A.D. and are, 
accordingly, at leat a thousand years later than the Rigveda. An 
exact comparison with one of these Old Tamil classics, the Purananiru 
is now made possible by the recent publication of a word index of 
this text,4° which enables us to make the following statements. | 


First, although Tamil enru properly means “having said’’, it 
could be used in Old Tamil (as it isin modern Tamil) merely to 
mark off the preceding words as direct speech. As such it could be 
followed by a verb of saying. In such Old Tamil instances like enru 
collupu or entu palar kira*} the function of enru is not noticeably 
different from that of Sanskrit iti. The closest English translation 
would be ‘thus having said’, resp. ‘thus many having said’. In 
Old Tamil, however, enru was not the most frequent marker of 
direct speech. Far more common was the form ena, a different 
fromation from the same verbal root, which occurs after direct speech 
in more than 200 passages.42 Now, this function of the Old Dravidian 
ena, enru enables us to explain not only why in pre-Vedic times 
the words ‘“‘thus he said’? were put after the direct speech but also, 
and particularly, how thc word iti came to be syntactically separated 


40 See V. I. Subramoniam, Index of Putanaanuutu (University of Kerala, 1962) 

41 Putam 152.18, 278.3. My sincere thanks are due to Dr. Kamil Zvelebil, who 
kindly checked the majority of the passages in the Puranantru. He found that 
ena oceurs in Puram at least 200 times as a simple marker of direct speech, 
follow2d by such verbs of saying as nuval-, ktitu-, e.g. 279-10 tai ceyvinai 
mutitt.ena két pal “Is their (good) work finished’ thus I shall ask’’, 212.1 nmunko 
yar-ena vinavin “Whois your king?’ if you ask thus’, 239.6-7 valiyar-ena 
valimojiyalan, meliyar-ena mi-kkiralan “he will not praise (them) saying ‘they 
are strong’, he will not say with intensity ‘they are weak”. On the other hand 
there are only 26 occurrences of entu, mostly used in its full lexical meaning 
“having said”, and followed by verbs like ika]- “to reproach” é/tu- “‘to praise” 
but not by verbs of saying except for the two passages mentioned, viz. 152.18 
mak kol valam tammin entu collupu, kutuki “give the black stick to the right’, 
thus having said, having approached..., 278.3 cituvan...matinan enru palar kurta 
“The boy... has changed’ thus having proclaimed”. 

42 Cf. also, e. g. Tol. Col. 62 ena molipa “‘thus they say”. 
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from the following verb and to be connected more closely with the 
preceding words. 


That this actually has happened is apparent from the third 
construction, 


(3) He said “I shall come’’ thus (11), 


which occurs in eleven passages. Here the normal Indo-Aryan 
construction seems to be restored. The verbal form “he said’? precedes 
the words “I shall come’’, but iti ‘thus’? has now become a fully 
independent word that marks off the end of the direct speech. As 
such it has ever since remained in current use in Sanskrit and, as the 
fourth construction. 


(4) ‘“T shall come’’, thus (12), 


demonstrates, an additional verb was no longer necessary. Here we 
find the beginnings of the later practice in Sanskrit of expressing 
utterances or thoughts by direct speech followed by iti. In the Rigveda 
this had not become a fixed rule, and direct speech without fri still 
occurred by the side of the new construction.*s 


All these four constructions occur equally in the earlier and the 
later parts of the Rigveda. So the shift in the syntactical structure 
which resulted from the re-interpretation of */ shall come’’ -thus he 
said as “I shall come” thus—he said cannot be shown to have taken 
place in historical times, and the explanation here proposed necessarily 
implies the assumption that the inherited Indo-Aryan construction 
had been adapted to the Dravidian pattern in the prehistoric period. 
That this process can be considered the first manifestion of what was 
to become a general areal tendency is proved by Munda, 


Jules Bloch, curiously enough, denied the existence of a 
similar construction in Munda.4# Nevertheless, in several northern 
languages like Santali and Mundari, whose structure does not admit 
of indirect speech at all,45 the word mente (lit. ‘“‘by saying’) is used 
in quite the same way as enain Old Tamil and iti in Sanskrit. Ina 


43 Renou, Grammaire védique (1952), p. 392: ‘‘La situation normale reste le 
discours direct sans particule’’. Geldner in his notes to his translation often 
adds the remark “‘iti zu erganzen’’, e.g. VI. 16.26. This betrays an imperfect 
understanding of the Rigvedic construction and its historical development. 

44 See L’Indo-Aryen, p. 327. 


45 Cf. Hoffmann, Encyclopaedia Mundarica, p. 2817, A. Nottrott, Grammatik der 
Kohl-Sprache (1882), p, 89, Bodding, Materials for a Santali Grammar Il (1929). 


pp. 252, 269, Santa! Dictionary, 4 (1958), p. 280. [Hoffmann, Mund. Gr., pp 
Pye, 175, 211.) > PP, 


ijdl Vol. III No. 1 149 Kuiper 


southern Munda language like Sora the equivalent is gamle ‘having 
said’’.46 The circumstance that in North and South Munda different 
words, derived from the common verbs of saying, are used suggests 
the conclusion that this construction has been introduced in com- 
paratively recent times. Indeed, in an isolated northern language 
(Korku) and in Kharia it is, as far as I am aware, unknown.47 Some 
modern Indo-Aryan languages, like Marathi and Bengali, have 
developed similar constructions by using, instead of Sanskrit iti, 
derivatives of verbs of saying (cf. Mar. mhanin, Beng. boliya), 
parallel to those of Dravidian and Munda. They can be used ina 
way more or less comparable to that of Dravidian.48 Ever since the 
prehistoric period, accordingly, the tendency for adaptation to the 
Dravidian pattern has produced new formations and constructions 
in the Indian area. 


The question naturally arises whether the tendency towards 
convergence in a linguistic area is actually such a real and persistent 
factor that the evidence of modern Munda can be used in support 
of an interpretation of a prehistoric development in Indo-Aryan. A 
particular detail shows that it can. 


A specific use of enru in Tamil is that after onomatopoeas, as 
in pata-v-enru “‘with the sound pata pata’’ (lit. ‘‘saying pata pata’’). 
In Old Tamil ena is used in a similar way, e.g. tutum ena, kall ena, 
ilum ena.*9 Some Munda languages which have developed the 
construction of direct speech with mente or gamle imitate Dravidian 
also in using these words after onomatopoeas.59 Now, although in 
the Rigveda a similar use of iti is still unknown, in later Vedic texts 
the first instances are found of what was to become the normal 


46 SeeG. V. Ramamurti, 4 Manual of the Sora Language (1931), pp. 52, 149, Sora- 
English Dictionary (1938), p. 97: irte gamle anin opuninten (“I shall go saying he 
told me’’— “he said to me that he would go”’. 


47 See John Drake, A Grammar of the Kirt Language (1903), p. 136. 


48 See Jules Bloch, L’Indo-Aryen (1934), p. 327 f., and cf. Bloch, La formation de la 
langue mara the (1920), p. 272, Navalkar, The Student’s Marathi Grammar 
(1880), pp. 357, 667, Wilberforce-Bell, A Grammatical Treatise of the Marathi 
Language (1914), p. 39: mi tujhe ghari yein mhanun mhanala “he said he would 


come to my house”’, 


49 putam: tutum ena 243.9, kall ena 320.9. ilum ena 3.3, 93.1, 159.18, 176.5, 237.18 


399.33- 
50 E. g., Santali sat mente “suddenly” (cf. Skt. jhat iti), Sora rameyen mauyn-mauy 


gamle gutc ‘the cat cries maung maung”’. 
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practice in later Sanskrit.°! The Rigveda uses a cognate of iti in a 
slightly different way in 4] ittha’ “indeed, forsooth’.®2 So there is 
a strict parallelism between the Indo-Aryan innovations of the pre- 
Vedic and Vedic periods and those in Munda. The case of iti may 
be considered a classical example of how a language can adopt foreign 
syntactical constructions while retaining its inherited morphological 
elements. 


I think no apology is needed for this rather minute analysis 
because only in this way are we able to reconstruct to some extent 
what happened more than three thousand years ago. And this exact 
reconstruction is indispensable since, once it has been demonstrated 
that the use of a single word has been conformed to a foreign pattern 
in pre-Vedic times, this conclusion may also be valid for problems 
of wider importance. Such a problem is the origin of the gerunds 
in Sanskrit. 


7. In such constantly recuring phrases of the Rigveda as 
“Indra after killing the dragon let loose the waters’’ the poets use, to 
express “after killing”, beside the inherited perfect participle some 
indeclinable forms derived directly from the roots (the so-called 
gerunds). It is generally agreed that they are comparatively recent 
innovations of Indo-Aryan. Their distribution over the older and 
more recent books of the Rigveda points to the conclusion that their 
use was not yet firmly established. In two of the oldest books 
(IV and VI) they are practically absent whereas more thaa half 
of the occurrences are found in the two last books (I and X).58_ Still, 
they cannot have been created at such a late date as their occur- 
rences in the Rigveda might seem to suggest. 


Morphologically they are generally assumed to be old instru- 
mental forms-an assumption that is supported by parallel formations 
in North Munda. However, this explanation implies the assumption 
that they must have been created long before the oldest hymns of 
Rigveda were composed. Asa matter of fact, the Vedic form &rrva’ 
**having done”’ points to an old type of paradigmatic ablaut that has 
left no traces in the Rigveda. The contrast between the early origin 


51 In dahis te astu 53°] iti, AS, 1.3.1-9, KS. 13.9 (p. 191‘ 7), TS. TH1.3.10.2, varséQe- 
ksantu ba] iti AS XVII. 2,22 and dhig ity abhigatah, $a'l ity apa’krintah, pha'l 
ity abhisthitah AS XX 135.1 (Kunt&pa-hymmns). Cf. ghpaa iff SB XTV. 1.1.10. 

52. RS. I. 141.1, V. 67.1, 84.1, VI. 59.2. 

53. 127 IT'S Whe [V2 V5 Vis Vil4 VII? IXS XS8. The total number is 


143 according to the list occurring in Delbruck, Das altindische Verbum (i874), 
p. 228f. 
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of these forms and their late adoption into the archaic priestly poetry 
is apparently the result of a social differentiation. These gerunds 
must have emerged among lower social classes and must have been 
used in colloquial speech, long before they found acceptance in the 
highly traditional religious poetry. 


The introduction of these gerunds has led, especially in 
narrative prose of later Sanskrit, to the development of a type of 
sentence consisting of a long series of gerunds and ending with a 
single finite verb form Now, this is exactly the Dravidian sentence 
pattern. As a matter of fact, Dravidian admits of only one conjugated 
verbal form, at the end of the sentence. Other verbal forms, when 
referring to the same subject, are indeciinables, used in quite the 
same way as the Sanskrit gerunds. It is not surprising, therefore, 
that, at least from the beginning of this century, the possibility that 
the Sanskrit gerunds might be a calque of the Dravidian indeclinables 
has often been considered.54 However, as was expressly stressed 
some twenty years ago, this necessarily implies that the adaptation of 
the Vedic sentence pattern to that of Dravidian must date from a 
pre-Vedic, that is prehistoric, period. As stated above, several 
scholars have since expressed their disbelief *5 


The parallelism with the case of iti is, it would seem, obvious. 
In Dravidian, which has no subordinate clauses in the current sense 
of the word, a sentence structure without gerunds is inconceivable. 
In Munda the corresponding forms in the northern languages differ 
from those in South Munda: cf, Santali sen-kate “‘having gone’, 
Mundari hiju-akan-te “having come’, Ho agu-ked-te “having 
brought’, Korku hadir-en-ten “having arrived’’, as opposed to Kharia 
col-kon “having one’’, Sora jum-le “‘having eaten’, Gorum sunu-du 
“having fallen’. This, together with the fact that cognate languages 
outside the Indian area (like Khasi) do not have them, proves the 
Munda forms to be innovations. 


Again, as in the case of iti, modern Indo-Aryan languages have 
independently created new formations which have taken the place of 
the older gerunds of Sanskrit: Hindi j4-kar, Gujarati avine “having 


54 Ci. Sten Konow, Indian Antiquary, 32 (1903), p. 455, Linguistic Survey of India, 
4 (1906), p. 280, J. Bloch, La structure grammaticale des langues dravidiennes 
(1946), pp. 67, 100 (but cf. BSOS, 5 [1925], p. 733f.), Kuiper, India Antigu (1947), 
p. 211f., P. Meile, L’Inde Classique 1 (1947), p. 119. A different view was taken 
by Pavel Poucha, Archiv Orient4ini, 17/2 (1947), p. 292, who considered this a 
structural affinity of a larger group of Asiatic languages. Cf. Emeneau, Lg., 32 
(1956), p. 9. 


55 Sze Renou, Histoire de la Inngue sanskrite (1956), p. 29 n. 1. 


ey eh ager “) 
Was) 4 OW Be ne 
Heo eed eae 
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come’? are used in quite the same Way as their Tamil equivalent 
ya-ntu. §o the reality of a convergent development cannot be ques- 
tioned : the prehistoric creation of the Indo-Aryan gerunds was its 
first manifestation. Incidentally, their antiquity is also apparent 
from the fact that, when they were created, laryngeals must stil have 
been pronounced in word-final position. 


We can only account for the total mass of facts by assuming 
that long before the oldest hymns were composed the use of gerunds 
in proto-Indo Aryan arose among bilinguals, presumably in collo- 
quial speech. The poets continued to use the traditional perfect 
participle until, in the last period of Rigvedic poetry, when the 
influence of other social classes became stronger, the new formation 
was fully accepted even in poetry. 


8. Summing up, I may state in the oldest text of Indo-Aryan, 
dating from some three thousands years ago, three unpredictable 
innovations were found to occur, viz. a new set of phonemes and two 
new traits of sentence structure, one connected with a new gramma-— 
tical category (the gerund), and the second with a completely new 
use made of the inherited word iti.56 In two of these cases the 
tendency for convergence could be shown to be a persistent factor in 
the Indian linguistic area, as the Sanskrit innovations were at a 
much later time again replaced by new formations of modern Indo- 
Aryan languages. 


The cumulative evidence of the three pre-Vedic innovations 
would seem to leave little doubt as to the role of Dravidian in the 
Indian subcontinent. If this evidence should prove strong enough to 
put an end to a dispute which started more than a hundred years ago, 
the effort made to demonstrate this thesis has not been in vain. 


The implications of this evidence are particularly interesting 
because the picture of proto-Indo-Aryan that emerges from this 
analysis differs considerably from the concept of a former generation. 
A language in which simultaneously Dravidian calques arose and 
Indo-European laryngeals were still pronounced was more progressive 
and, at the same time, more archaic than could be imagined a few 
decades ago. There is, besides, this historical implication that the 
period between the arrival of the Indo-Aryans in the Indian subcon- 
tinent and the composition of the oldest Vedic hymns must have been 


56 In this connection Holmer’s conclusion that phonology and syntax are as a rule 
first affected by foreign influence might be of interest. See Lund Arsbok 
1961/1962 ufgiven av seminarierna for slaviska sprak...(1965), p.49. 
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much longer than was previously thought. Although every attempt 
at a chronological fixation is mere guess-work, it would seem unsafe 
to date any part of the Rigveda earlier than c. 1400 B.C. 


When compared to Old Iranian, Indo-Aryan seems, even in 
prehistoric times, to have been open to foreign influences, while the 
development of Old Iranian was more largely determined by 
tendencies inherent in its own system. However, in the case of 
Iranian no comparison with foreign languages of the same area is 
possible, and the internal evidence alone, acquired by the study of 
an isolated linguistic system, is no sufficient basis for definitive 
conclusions. In Indo-Aryan it was not the unpredictability as such 
of the innovations but the wider Pan-Indian perspective that 
furnised the criteria for tracing foreign influences. To the methodo- 


logical problem that different linguistic observers may view the 
same data ina different perspective the only solution is that the 
conclusions derived from a more comprehensive view must prevail 
over those based upon a narrower perspective. 


In this respect the changes that took place in proto-Indo- 
Aryan, more than three thousand years ago, may have some importa- 
nce for the metodology of convergence studies in general. Inso far 
as these changes may be considered the result of a subtle interplay 
between internal factors inherent in the system and _ external 
factors of foreign influence they may contribute to our insight into 
the ever-fascinating problem of change in language. 


International Journal of Dravidian Linguistics 


THE BASIC ORDER TYPOLOGY AS A DEFINER 
OF AN INDIAN LINGUISTIC AREA 


Colin P. Masica 


(1) The hypothesis 


We have been made aware of the existence of linguistic con- 
vergence phenomena in the Indian region, and a number of specific 
cases of cross-genetic correspondence have been adduced. Some of 
these affect only limited areas, for example Telugu and Marathi, or 
Marathi and parts of the Kannada speech area. Others, for example 
the use of counter words (noun classifiers) tie certain SA languages 
to languages outside the subcontinent. 


It might be in order at this point to step back and view the 
whole picture. Do the characteristics which are shared by all or 
most Indian languages outweigh those by which some of them are 
variously linked to languages of contiguous regions? In other words> 
beyond particular examples of local convergence, and beyond a not 
unexpected sharing of features between links in a chain of contiguous 
languages of which those of India are but a part, does any overriding 
phenomenon we might call an ‘‘Indian linguistic area’’ exist? If so, 
on what basis— in terms of what criteria? If such an entity does not 
correspond exactly to the present political boundaries of India, or 
India+ Pakistan, where are its boundaries? What is its relation 
to history? 


It would be especially desirable if the criteria causing such 
an entity to emerge were to constitute in some sense a demarcatable 
set themselves rather than a mere accumulation of unrelated features, 
the relative importance of which vis-a-vis similar features which do 
not so pattern is difficult to measure. (We may of course take it 
that the mere fact of patterning areally in this way implies or 
constitutes relationship among features.) 


One such readily demarcatable set of criteria we might try 
to apply is suggested by Greenberg’s basic order typology 
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(Greenberg 1966). When one looks at Greenberg’s article, one is 
struck by the extent to which especially his word-order data (he also 
deals with a number of other matters) seem to cluster geographically 
as well as implicationally. 


Greenberg’s purpose, of course, was quite different: to 
ascertain, through examination of a world-wide sample of represen- 
tative languages, what might be presumed to be true of language 
universally-either in terms of flat occurrence/non-occurrence ot of 
implication of a particular grammatical trait in a language given the 
presence of another. Areal configurations which begin to emerge 
even on this basis fairly beg to be filled out and tested with other 
relevant languages. 


The tendency for traits to correlate, demonstrated by Green- 
berg, and the fact that all of the logically possible arrangements 
and combinations do not occur, or do so very rarely, should make 
the outlines of syntactic regions, if they do exist, fairly clear. At 
the same time, the fact that most of Greenberg’s implicational 
generalizations are neither one-way of nor absolute (i.e. rather just 
tendencies) makes it necessary to investigate and map each trait in 
question separately. 


For example, it may be true that all languages with dominant 
order Verb-Subject-Object are prepositional (Univeral No. 3), but 
the converse is not true-prepositional languages may have other 
orders as well. Similarly, the existence of exceptions to the rule 
(‘‘Universal No. 4’’) that ‘‘almost all’? Subject-Object-Verb languages 
are postpositional-and the impingement of these exceptions, more- 
over, precisely on our area of interest-makes it necessary to treat 
these traits as independent variables. 


These discrepancies actually make map- making a more 
meaningful exercise. If a number of traits tend to correlate but not 
absolutely, by drawing a separate isogloss for each we can expect 
a bunching of isoglosses at some points and a spreading at others, 
the steepness of gradient becoming indicative of the sharpness of 
the typological boundary. (In practice, we shall adopt certain 
shortcuts here.) In most sorts of feature mapping, one expects to 
find such zones of overlap and transition. 


Recent work on word order and deep structure by Ross 
(1970). Bach (MS.)! and others makes it necessary for us to make 
it very clear here that itis surface structure that we are concerned 
with. It is surface order which patterns areally. It appears from 
the above studies that deep order reflects genetic stock: in other 
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words, at this level any convergence patterns dissolve —at least as 
far as order features are concerned. While it is an insight of key 
importance for the history and process of formation of linguistic 
areas that intrusive languages appear to conform themselves to pre- 
vailing local patterns by means of late transformations (and 
presumably might thereby be identified, even in the absence of other 
data, as intrusive languages in contrast-barring a universal base 
structure-with languages in the same region for which such trans- 
formations are not required), we also must declare that we do not 
address ourselves to these problems here, or to the historical dimen- 
sion in general. For purposes of this paper, we will confine ourselves 
to testing the hypothesis that synchronic surface order features of 
themselves constitute a delimitable set of criteria useful in defining 
linguistic areas in general and and Indian linguistic area in particular. 


The particular features to be examined are largely those men- 
tioned by Greenberg (1966), with some additions and modifieations. 
Their choice is not formally motivated by anything in the grammar, 
except unconsciously. Features produced by transformations and 
features found in the base component are not distinguished at the 
outset, although some interesting differences may be observed 
(depending on the current state of theoretical relegation of a feature 
to one or the other component.) Adjectives are particularly note- 
worthy here. In general, the features examined are the more salient 
and readily ascertainable ones; a brief check has shown that other 
features éssentially just confirm the picture revealed. 


(2) Theoretical and practical difficulties. 


This is not to say that there are no problems, doing even 
this much. As Greenberg has noted, ‘‘the higher the construction 
in an immediate constituent hierarchy, the freer the order of the 
constituent elements.’’ (1966) In stratificational terms, this means 
that sentence elements such as nominal, verbal, and adverbial phrases 
admit of more alternative arrangements among themselves than do 
the constituent words of the phrases, and these in turn more than 
do the generally fixed morpheme-constituents of words. 


Unfortunately, we are especially interested in the arrangements 
of higher constituents, such as Subject, Verb, and Object, precisely 
because their greater ‘flexibility’, as it is usually called, would seem 
to make them more susceptible to areal influences. Lower-order 
constituents become progressively less so, barring profound restru- 
cturing (‘‘pidginization’’?), as we strike closer and closer to the core 
of the often remarkable persistent genetic heritage. An example is 
Brahui?, which at higher level of syntax, such as those governing 
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the embedding and conjunction of sentences, has abandoned the 
‘‘Dravidian’’ mode of expression (participles) for the ‘Iranian’’ 
(conjunctions), but retains a recognizably Dravidian morphology, 
including a negative conjugation, a causative formation with a labial 
(-ef-, -if-, cf. Tamil -vi-, -pi-), etc. The case of Romany (Gipsy) 
is accordingly more extreme, for it has adopted not only the 
dominant SVO order of Eastern Europe but also a system of verbal 
prefixes analogous to those of Slavic (Kochanowski 1963). Although 
such cases may be more frequent than we realize, it remains a fact 
that we are more likely to find borrowed or shared features at the 
higher levels of syntax. 


It is here also, to repeat, that basic descriptive problems present 
themselves. Greenberg says that ‘‘the vast majority of languages 
have several variant [clause-element] orders but a single dominant 
one’, but offers us no clue as to how he isolated the latter. Several 
different situations could conceivably present themselves: 


1. orders other than order X are impossible 

2. orders X, Y, Z...occur, but orders Y, Z...are derivative 
or specifically conditioned 

a. structurally: e. g., verb position in German subordinate 
clauses. 

b. lexically: e.g., certain French adjectives-a finite list 
(bon, mauvais, grand, petit, etc.)-normally precede their 
noun 

c. semantically : here we include 

1. displacement for emphasis (or de-emphasis), common in 
many languages but not equally possible in all 


2. displacement for semantic and/or stylistic effect (again 
cf. French adjectives) 
Accordingly, order X emerges as the unmarked or ‘basic 
order. 
3. orders X, Y, Z...all occur and are apparently uncondi- 
tioned, but order X is more frequent 
4. orders X, Y, Z...occur with equal frequency (or nearly so). 
Obviously, there is no problem in Situation 1, and no solution in 
Situation 4-except by invoking transformational grammar, whose 
aid we have already rejected because of the havoc it would wreak 
in our convergence picture. Fortunately, Situation 4 does not 
seem to be too common-but, as will be seen, the particular situation 
that occurs is itself of areal significance. To anticipate a little, a 
language which cannot make up its mind is typically found 
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geographically suspended between two Opposing camps, and we 
shall not want to obscure the fact of its ambivalence by forcing it 
into one or another mold through any mere clever trick. 


The frequency criterion of Situation 3 naturally often 
combines with other criteria: displacement for emphasis or other 
purposes obviously depends for its effect on its being a violation 
of afrequency norm; conversely, it is difficult for an infrequent 
order to avoid producing some stylistic effect by the mere fact of 
its infrequency. Structurally or lexically conditioned orders are 
probably less frequent than their unconditioned counterparts. 


It is clear that Greenberg underestimated the scope of this 
problem for Dravidian, Indo-Aryan, and similar languages. He 
assigned them to a special subtype of SOV languages characterized 
by great rigidity of clause structure: only OSV as an altenative 
order, nothing except question and a few other particles permitted 
after V, which always ends the sentence-in other words, close to 
Situation 1. Thisis simply not so, for Tamil, for Hindi or Bengali, 
or even for Turkish (as he acknowledged for the last in an appended 
note). All permit displacement for emphasis or stylistic effect, not 
only to OSV, but also to SVO, and perhaps most commonly, OVS. 
Adverbials can occasionally follow the verb. Characterization of 
Indian languages as SOV-types will accordingly have to rest heavily 
on frequency? and on such matters as restricted order in subordinated 
clauses. 4 


Like Greenberg, we shun the burden of proof in every case 
on the plea that there is too much else todo. In what follows, the 
reader will have to assume that there are good grounds for taking 
word order to be as stated fora given language, and that this is in 
most cases the prevailing descriptive opinion (pre-transformational, 
that is.) We have rechecked Greenberg’a data, which are subject 
to dispute or inconsistent at some points. It is inevitable, when 
attempting to cover so vast a ground, that some missteps will occur. 
Wecan only hope that ours will be asclear and vulnerable to 
correction by concerned specialists as Greenberg’s. 


(3) The Indian syntactic norm 


The problem for investigation presents itself in two parts. 
The first is to establish that there is substantial agreement in word 
order among languages of the subcontinent belonging to different 
genetic stocks, and to note in what that agreement consists. The 
second is to discover the areal extent of those traits, in terms of 
isoglosses and the configurations they define-if indeed they can be 
drawn. That also remains to be seen. 
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We might approach the first question with the abstract pre- 
mise that Indian languages are, for example, “left-branching’’, and 
measure them and others against that yardstick. However, this 
would either assume in advance that Indianization is equivalent to 
Dravidianization, or shift the focus of our concern from areal to 
general typology. Weshall adopt rather the empiric approach of 
selecting representative languages from each of the three allegedly 
converging stocks in India-Dravidian, Indo-Aryan, and Munda- 
and ascertaining which order traits are common to them, 


For this purpose we have chosen (Hindi-Urdu (H), Bengali 
(B), Telugu (T), Malayalam (M), and Santali (S), representing 
respectively western and eastern Indo-Aryan, central] and southern 
Dravidian, and Munda* Features of surface syntax shared by 
these five languages include the following: 


(A) (S)OV - the important thing here is the last part, Object 
precedes Verb, as contrasted with Greenberg’s other two main types, 
SVO and VSN, in which the Verb precedes the Object. As we have 
implied above, this isa norm, not an absolutely fixed order, in 
Indian languages. 

Examples (1): 

raaman hindii boltaahai- ‘Raman speaks Hindi’ (H) duusraa up- 
aay sociitigaa- ‘I will think up another way out (H) se EkTa gaRi 
kineche- ‘He has bought a car’ (B) ami tar nam jadi na- ‘I don’t 
know his name’ (B) kamala puulu koostunnadi- ‘Kamala is 
plucking flowers’ (T) annam tinaNDi- ‘Please eat the food’ (T) 
avasaanam maanine kaNTupiTiccu-‘Finally he spied the deer (M) 
paNi veegam tiirkku- ‘Complete the work quickly’ uni hoR-.ko 
tOlkedea— ‘They bound that man’ (S) ac’ren golam-e kolkedea- 
‘He sent his servant’ (S) 


Counter-examples may be found, generally under special conditions 
of emphasis. 


Examples (2): 
kar cuke duusraa up1zay-‘You’ve already exhausted other way (s)’ 
(H) aydunel alanunci aNTunnaaw padirodzulu padirodzulani- 
‘For five months you’ve been saying ‘“‘ten days, ten days” ‘*(T) 
tum choR do abkii se khetii “You give up farming right now! (H) 


A speaker of both Hindi-Urdu and Telugu feels that Hindi 
lends itself to this more than Telugu does. Dravidian languages 
have other means of conveying emphasis. 
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So strong is the force of this norm even in Hindi-Urdu. 
however, that it may be used as one criterion for setting up Agentive 
and Dative “Subjects” as special categories in various Indian 
languages. 
Examples (3): 
kutte ne piitich hilaaii - ‘The dog wagged his tail’ (H) 
mujhe ek ciTThii likhnii hai-‘I have to write a letter’ (H) 
enikku raamane arinnilla-‘I didn’t know Raman’ (M) 
aameku amerikaa tsuuDaalani undi-‘She wants to see 
America’ (T) 
Our examples so far have been confined to noun objects, but 
pronoun objects behave no differently. Santali might appear to 
present certain problems because of object-incorporation in the verb: 


Examples (4): 
in agu-ko-a-‘I shall-bring-them’ 
ba-ko sap’-led-e-a-‘They did-not-catch-him’ 
gidra-ko benao-ked-in-a -‘They treated—-me as a child’ 


This might at first appear to represent a pronoun object 
structure VO. (The ‘adhesion’ is no bar: our material contain 
other examples of suffixed pronoun objects.) However, this same 
structure occurs when nominal or separate pronominal objects also 
occur : 


Examples (5): 
mErOm-e gOc!’ -ked-e-tii-a- ‘He killed my goat’ (Goat-he 
killed-it my) ENDEkhan in hO cet ‘a-i-me- ‘Then teach me too 
(Then-me-too-teach- me-thou) 


These ‘“‘infixes” are (in aJl cases except the 3rd person singular) 
recognizably the same morphemes that constitute the separable 
pronouns, Any notion that personal endings have to be different 
than the pronouns, however, is an Indo-European prejudice, and 
we may call this phenomenon accordingly mere object-indication 
in the verb inflection. As the separate pronoun subject is ordinarily 
not used except for emphasis in languages where it is already 
sufficiently indicated by the personal inflection of the verb, so also 
in Santali the separate pronoun object is ordinarily not used, being 
already sufficiently shown by the verb, as in Examples (4). 


For Indian languages and many others as well, the term 
OBJECT might be broadened to VERBAL COMPLEMENT, for 
goals of verbs of motion, predicate nouns and adjectives, so-called 
adverbial complements, infinitival complements of catenatives, and 


“ae 
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certain other items tend to occupy the same position as the Direct 
Object, whether in SOV or SVO lanuages. Compare the English 
translations of the examples below. 


Examples (6): 


sab log apne apne ghar jaao-‘Everybody go home’ ami somdeber 
baRite jacchi- ‘I’m going to Somdev’s house’ (B) iNTiki 
weLdaam - ‘Let’s go home’ (T) kaakka maaninteyaTutta 
parannu cennu-‘The crow flew to the deer’ (M) onko oRak’'te-ko 
sEn akana—‘They have gone home’ (S) ham sab muurakh hai- 
‘We are all fools’ (H) tini Ekjon DakTar chilen-‘He was a doctor’ 
(B) ciire nallagaa undi- ‘The saree is black’ (T) caaya tayyaar- 
aayi- ‘Teais ready’ (M) rakSappeTaan nookkukayaayirunnu- 
‘He was trying to escape’ (M) noakathadO sAri kangea- ‘This 
story is true’ (S) ar ADi OkOc'ak’ko cet'ako-e pOrtOnket’a- 
‘And he began to teach them many things’(S) 


The order is the same, only more rigid, in subordinate clauses, 
and the same in interrogative as in declarative sentences, except in 
Malayalam, where there is a tendency to put the question-word first. 


Examples (7): 


aar-aaN-avaL? - ‘Who is she?’ (M) 
ent-aa nii nirutti - kkaLafifiata? - ‘Why did you stop?’ (M) 


This is merely a stylistic tendency, dependent moreover ona 
peculiar use of the quasi-copular element / aa (N)/, and pronominali- 
zation of the verbif there is one. The pan-Indian order, putting 
the question word in place of the word used to answer it in a 
declarative sentence, is also possible. 


Examples (8): 


nii-y-ennine ivile? — ‘How did you come here?’ (M) 
ninnaL-aar-aaNa? - ‘Who are you?’ (M) 


(B) V+Aux — the definition of “Auxiliary”? needs some attent- 
ion here. Let us be clear class that we do not mean by Aux what it has 
meant in transformational descriptions of English. Greenberg’s 
definition (‘‘a closs of verbs inflected for both person and number.... 
in construction with an open class of verbs not inflected for person 
and number’’) 1966 makes strange demands unsuited to Indian 
languages, where the main (openclass) verb may be inflected for 
number (H), where no verbs may be inflected for number (B), 
where auxiliaries in certain tenses may not be inflected for person (H), 
or where no verbs may be inflected for either person or number (M). 
We fail to get the point. 
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Although a simple open vs. closed criterion would serve well 
enough for Indian languages, it would lead to too many complications 
in the languages we are going to compare them with— interesting 
complications, but best deferred from present consideration. For 
the purposes of this paper, let us define Aux as a tense-marking 
word or quasi-word in complex verb forms. In our Indian languages 
sample, tense-marking auxiliaries invariably follow the main verb, 
and close the construction. 


Examples (8): 


aa rahaa hai — ‘He is coming’ (H) aa rahaa thaa—‘He was coming’ 
(H) aayaa hai- ‘He has come’ (H) aaegaa-—‘He will come’ (H) 
phire eseche -‘He has returned’ (B) jacchi- ‘I am going’ (B) 
jabo- ‘I shall go’ (B) tsadawaleedu- ‘I/he/you/we/etc. did not 
read’ (T) tsadiwi unTini - ‘I had read’ (T-lit.) parann-irikunnu- 
‘«...has flown’ (M) parann-irunnu- ‘.. had flown’ (M) varunnu- 
NTaayirunnu-‘...was coming’ (M) calak’ kana-‘.../is/are going’(S ) 
gOc’ akadea —‘.. have killed’ (S) 


Colloquial Telugu (of the Coastal Districts) presents certain 
difficulties for our adopted working definition in that the common 
auxiliary annaanu, unnaaw, ete. has submerged the distinction 
between Present and Past: tsaduwutunnaanu= ‘J am/was reading’. 


(C) PO-Our Indian languages are postpositional rather than 
prepositional. They moreover generally possess two kinds of post- 
positions, a small set of short particles resembling suffixes (or as older 
descriptions would have it, ‘‘case-endings’’), and more numerous 
longer forms, derived from nouns or verbs, added to genitive or 
dative case forms of the noun or pronoun. 


Examples (9): 


ghar me’—‘in the house, at home’ (H) ghar ke piicha - ‘behind the 
house’ (H) baRite- ‘at home’ (B) boner mOddhe- ‘in the middle 
of the forest? (B) gumpulo- ‘in the crowd’ (T) Dabbu koosam- 
‘for money’ (T) muriyi/—‘in the room’ (M) aa maraftinre kiizhe- 
‘beneath that tree’ (M) oRak’re- ‘in the house, at home’ (S) 
bir sEnre-‘towards the forest’ (S) 


(D) Adj +N, Gen + N, Dem + N, Num + N- In the Indian 
languages in question, adjectivals5, genitive expressions (defined 
either specifically in terms of a relationship of “possession”, or 
merely in terms of the subordination of one NP to another by the 
same means used to indicate possession) or by another means, 
demonstratives, and numerals all precede the nouns they modify. 
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Examples (10): 


andher ii raat—‘dark night’ (H) (undor chele - ‘Beautiful child’ (B) 
pedda samasya~‘big problem’ (T) nalla hooTTal-‘good hotel’ (M) 
mit’ Thuikut’ saham-‘a small piece-of-firewood (s) ruam kaa 
bhaaii-‘Ram’s brother’ (H; puttiy6 kaa ek Dher- a pile of leaves 
(H) somdeber baRi - Somdev’s house (S) boner raja — the king 
of the furest (B) keantaaraan koopam - kantarao’s anger (T) 
aandhradeesa caritra- ‘the history of Andhra’ (T) kaakkayuTe- 
muTTa_—‘the crow’s egg’ (M) ente muriyuTe taakkool - ‘the key 
of my room’ (M) Johan babaptisic’ reak' bOhOk! - John the 
Baptist’s head’ (S) bir reak' dare - ‘a tree of the jungle (jungle 
tree)’ (S) vah aadmii-‘that man’ (H) yah aurat-‘this woman’ (H) 
ei lokTi- this man’ (B) ii niiLLu - ‘this water’ (T) ii aama - ‘this 
turtle’ (M) nui hOR-‘this man’ (S) do ghaNTe - ‘two hours’ (H) 
pacTa kukur—/five dogs’ (B) muuDu nelalu-three months (T) 
naalpatu kuzhi - ‘forty holes’ (M) bar poeSa-‘two pice’ (S) 


It will be noted that Telugu marks the genitive relationship 
by positson only (in the case of nouns). 


(E) Qualif + Adj-qualifiers of adjectivals precede them. 

This includes not only ‘‘intensifiers’’- words for very, somewhat, 
etc. but also qualifizrs of comparison-the equivalents of ‘‘than X’’, 
which also may be viewed as qualifying an adjectival: how big? 
Very big. Bigger than an albatross. Following Greenberg (1966), 
the criterion is the position of the markere (M) of comparison 
(= ‘‘than’’) and the standard (S) of comparison (=‘‘X’’) relative to 
the adjectival (A): equivalents, if any of “‘er’’, ‘“‘more’’, etc. are 
ignored. Indian languages have the comparison structure SMA, 
the modifying elements SM preceding the modified A, rather than 
following as in English (AMS). 
Examples (11): 

bahut acchaa ‘very good’ (H) 

usse acchaa-‘better than that’ (H) 

khub gOrom—‘very hot’ (B) 

gOtokaler cee besi gOrom-‘hotter than yesterday’ (B) 

tsaalaa ettugaa (uni) - ‘it is yery high’ (T) 

kamala kaNTe andangaa (uNTundi)-—‘(she is) more beautiful than 

Kamala’ (T) 

kiTTu valiyo dhanavaan-aayi-Kittu became very rich’ (M) 

haliimaa ninre mumtaasine kkaa\ etrayoo uyarnnavalaaNa- 


‘Haleema is much greater than your Mumtaz’ (M) 
ADimarah sadOm-‘a very big horse’ (S) 
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inren khOne marana-‘(it is) bigger than mine’ (S) 
Santali, significantly, also permits the order Adj+Qualif: maran 
OkOc' hati-‘a very large elephant’ (S) 


All the features so far described have been instances of Jeft- 
branching - the modifier precedes the modified. However, we cannot 
simply say that Indian languages as such are left-branching. Only 
Dravidian languages-and not all of them-are consistently and 
exclusively left-branching. The crux of the matter in the complex 
sentence. Although conditional, temporal, and other adverbial 
clauses mormally precede the main clauses to which they are sub- 
ordinated in all the languages, only Dravidian (that is, in our sample, 
Telugu and Malayalam) forces nominal object clauses to precede 
their verb and adjectival (‘relative’) clauses to precede the noun they 
modify : in Hindi-Urdu. Bengali, and Santali they normally follow. 
The typical Indo-Aryan device of relative marker preceding the noun, 
with the main body of the clause (including the verb-Greenberg’s 
criterion) following the noun but preceding a correlative in the main 
clause, may be considered a compromise as far as relative clauses 
are concerned. The noun and the correlative pronoun as it were 
trade places, so that the clause does precede the noun’s proxy, 
the correlative. 


Examples (12): 
‘the very stone which the builders rejected’ 
H: jis patthar ko / raajmistriiyé ne / nikamma Thaharaayaa/vahii 


which stone _ the builders rejected IT 
is — / illu kaTTuvaaru / niraakarincina / raay 
(by) the builders rejected STONE 
M: — / viiTu paNiyunnavar/aakaa ennu taLLiya/kallu tanne 
(by) the builders rejected stone-itself 


The H verb, of course, is not adjectival here. Moreover, a form 


without a correlative, and with the marker following the noun, seems 
to be equally common: 


yah anjiir kaa peR jise tuu ne sraap diyaa thaa svukh gayaa 
— ‘the fig tree which you cursed has withered’ 
Despite the awkwardness of the Aryan construction when compared 
with the Dravidian one, it is often imitated by the Dravidian 
languages themselves (if that is not begging the question.) 
Examples (13): 


eemi dzarigindoo adi dzarigipooyindi - ‘What has happened, has 
happened (T) (f. H jo huaa so huaa) 
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ii Dabbu ekkaDanunci vaccindoo, akkaDikee pampu — ‘Send 
this money back were it came from’ (T.) Cf. H. ye paise 
jahaa se aaee, vahii bhej do; B. jar kach theke eégeche take 
phirie dao 


Participial constructions subordinating a VP to an NP are possible 
in Hindi-Urdu, especially when the VP involves no complements 
or other extensions, more readily then in English, but the point is, 
they are not compulsory-and even then need not always precede 
the noun. 


Examples (14): 
giraa huaa bairii-‘a fallen enemy’ 
rota ahuaa baccaa—‘the crying child’ ‘the child who is crying’ 
oeR par coRha2z huaa aadmii-‘the man who had climbed up the 
tree’ 
zamiin paor leTaa huaa aadmii-‘the man who was lying on the 
ground’ 
bace hua log-‘the people who had been saved (will have) 
rakkhaa huaa khaanaa~‘the food which was kept’ 


Statistical measurements of the frequency of such constructions 
vs. equivalent relative constructions in texts of various languages 
under consideration here (including English, German and Russian) 
would no doubt be of interest, but let us adopt as acrude taxonomic 
principle here the behavior of VP’s with complements when sub- 
ordinated to a noun. On that basis, our five-language sample 
exhibits no common structure, only tendencies in that direction. 


That is not to say that the five headings above exhaust the 
points of agreement in syntax among our representive languages. 
Others seem to include, for example, adverbials-both their position 
in the sentence and relative to one another. These are matters 
that are merely too complex to be gone into here. 


(4) The areal configuration 

We now come to the second and most important stage in our 
investigation-to determine whether the syntactic points on which 
our representative languages agree are indeed characteristically 
‘Indian’? and useful in establishing, in terms of isoglosses, an 
‘Indian (on SA) linguistic area’’. We shall do this by attempting 
to trace each trait established above outward in all directions until 
we meet resistance in terms of an opposing trait; there an isogloss 
is drawn. This is putting it oversimply, of course. Our “‘isoglosses’”’ 
will have to be for the most part highly generalized on the basis of 
whole languages in their standard form, rather than on dialectological 
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or reporting-station information. This will serve the purpose of 
basic blocking out necessary for our thesis. Further refinements 
become meaningful only if that is proved. The problem posed by 
mixed or conflicting traits in a given language will be dealt with by 
adopting the convention of drawing the isogloss in question through 
the middle of the speech-territory concerned, without implying that 
there is a boundary at precisely that point geographically. 


We had better go back a step further and determine whether 
the traits in question are so distributed as to permit the drawing of 
isoglosses : a mere irregular concentration is conceivable. However, 
this does not seem to be the case. Languages which are exceptions 
to an areal pattern are rare enough to be merely recorded as excep- 
tions, although always posing intriguing historical questions. The 
patterns stand. 


This will only become clear to the reader as we proceed to 
trace the distribution of the traits. 


(A) OV (or, except where noted, Complement + Verb) - this 
basic pattern includes not only the remaining Drav. and IA languages 
of the subcontinent, but also the Tibeto-Burman, Iranian, and 
Dardic languages of its borderlands (including Tibetan, Burmese, 
Pashto, and Persian)- with the important exception of 
Kashmiri. It also includes Burushaski and the Tai-affiliated Khamti 
language of Assam, and extends to Armenian, Kurdish, Ossetic, the 
Turkic and other Altaic languages, Korean and Japanesce, the Hyper- 
borean languages, and Eskimo. The OV area appears to continue 
even further, into northern North America, but we shall not pursue 
this here. In another direction, it reaches out to take in both the 
Hamitic and the Semitic languages of Ethiopia and Somaliland. 
It appears to include the Andamanese languages, and according to 
Greenberg’s data, most of aboriginal Australia. 


Beyond this massive bloc, OV languages stands as_ isolated 
exceptions in equally massive blocs of VO languages-Basque in 
Europe, Lolo in southern China, Hottentot and a few other langu- 
ages in Africa. Arabic and Berber, Greek, Romance, Celtic, 
Germanic, Balto-Slavic, most African languages, even Georgian and 
Finnish are part of a western VO bloc. Chinese, Thai, Mon-Khmer 
languages, Vietnamese, and Indonesian languages (including Mala- 
gasy) are part of an eastern one. 


The latter bloc reaches in, through the Palaung languages of 
northern Burma, to subtract the Khasi language of Assam from the 
SA area. Interesting qualifications also must attend inclusion of 
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the sothern Munda language Sora. Free lexemes in this language 
order themselves (S) OV, but in compound forms, which are very 
numerous, the ancestral order VO is preserved.é 


The Indo-Aryan Sinhalese also presents special problems. 
As we have seeen, most SA languages permit some violation of the 
OV rule under special conditions of emphasis. In Sinhalese, this 
is formally signalled by a special form (-nne) of the verb, in the 
presence of which optional but commonly invoked rules of order 
change make VO patterns quite frequent. 


Examples (15): 


mahattea suruTTu bondwa-—‘The gentleman smokes cigars’ (Sinh) 
mahattea paawicci kAranne mee suruTTu — ‘The gentleman uses 
these cigars’ (Sinh.) 


mee baseka kalutArATA yanAwa-‘This bus goes to Kalutara 
(Sinh:) 

mee basekA yanne kalutAr ATA-‘ditto, or It is to Kalutara that.’ 
(Sinh.) 

(see Fairbanks et al. 1968:7,43,et passim.) 


Other kinds of attrition are to be seen in the west. In 
Ethiopia and in Iran and its borderlands pronominal objects may be 
suffixed to the verb, thus presenting a structure Vo: 


Examples (16): 
didam-esh ‘I saw him’ (Persian) 
man aa-girsan-ish-‘I will seize them’ (Balochi) 
gaddalhu aachaw-‘I killed them’ (Amharic) 


Although separable pronoun forms also occur in the normal position 
(So V), generally with emphatic value in speech, these enclitic forms 
appear to demolish Greenberg’s Universal No. 25 (‘If the pronominal 
object follows the verb, so does the nominal object.’’). Bach, ina 
recent paper devoted to Ambharic, notes that such exceptions to 
Greenberg’s universals occur where the deep structure order differs 
from the surface order, which in turn occurs where the typical order 
of the genetic stock to which a language belongs differs from that 
of the area in which it now finds itself. Wemay add that they 
also occur near the margins of major syntactic zones geographically. 


Across the OV/VO boundary it is interesting that the reverse 
of this pronoun phenomenon occurs, namely tbe enclitic pronoun 
object stucture oV in the VO languages Greek, Rumanian, Bulgeari- 
an, Italian, Spanish, French, Swahili, etc. 
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The broader category Complement + Verb also breaks down 
in the western borderlands. In colloquial Persian, Turkish, and 
Kurdish, the order norm for goals of motion verbs, which is VG 
does not follow that set by direct objects of transitive verbs, OV. 


Examples (17) 


boro manzel! — ‘Go home!’ (Persian) 
bé béin bd bazar — ‘Let’s go to the market’ (Kurdish) 
baatim-aa shaarii— ‘1 came to town’ (Kurdish) 


(B) The distribution of V + Aux broadly coincides with 
that of OV. Again Kasheiri is an exception in the SA context, with 
Aux + V in harmony with its VO. Again there is vacillation on the 
west, in Iranian. The Persian perfect auxiliaries follow, but that of 
the future (not used in colloquial Tajik) precedes (rafte ast— ‘has 
gone’, rafte bud-—‘had gone’, but xaahad raft - ‘will go’). In 
Fashuo tense-forming ‘‘particles’? (added to finite verbs) precede, 
but true auxiliaries (added to participial forms) follow : 


Examples (18) 


zA bA xat likAm- ‘I will be writing a letter’ (Pashto) 
maa xat likAlay vu -‘I had written a letter’ (Pashto) 


According to Penzl’s analysis, the particles are essentially 
modal in function. However, he cites an order transformation 
with the negative that brings the auxiliary itself before the verb: 


Examples (19): 


staa axbaar raagAlay day - ‘Your newspaper has come’ (Pashto) 
staa axbaar neday raagAlay - ‘Your newspaper hasn’t come 
(Pashto) 
Again, there are analogous phenomena in reverse across the 
VO boundary. In Bulgarian, Serbo-Croat, Slovak, Czach, Polish, 
and Ukrainian, certain auxiliaries (generally those of Perfect = Pasts, 
and Conditionals) normally follow, but precede under special condi- 
tions, while others (typically Futures the pattern varies slightly 
from language to language) normally precede but may follow under 
special conditions, as may certain Rumanian auxiliaries. This 
brings us finally to the German phenomenon of inversion of auxiliary 
and Main verb in subordinate clauses. 


(c) The distribution of Po follows that of OV, with certain 
noteworthy discrepancies, again found in the border regions. Persian 
and Tajik have exclusively prepositions (or possibly one postposition, 
if the accusative particle -raa is so classified), Closer in, Balochi ha, 
appropriately a mixture of the two. Pashto along with wakhis 
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Ishkashmi, and other Pamir languages has the same, plus a develop- 
ment which points up a certain inadequacy in the Po/Pr dichotomy- 
namely, both simultaneously, that is, discontinuously. We might, 
following Greenberg’s “ambifixes”’, call them “ambipositions’’: 


Examples (20) 


da kor tsaxa -‘from the house’ (Pashto) 
pa kor ki-‘in the house’ (Pashto) 
ta mez ladndi—‘under the table’ (Pashto) 


Kashmiri is postpositional, while Hindi-Urdu itself has at least one 
preposition, which also appears as ambiposition and postposition, 
namely the item binaa, ‘without’. 


To the south, at the Abyssinian end of the western frontier, 
Tigré has prepositions; Amharic, Tigrinya, Gafat, and Gurage have 
ambipositions and prepositions, Harari, Agau, Galla, Beja, and Afar 
have postpositions. Somali exhibits another new category: the 
equivalent particles are attached to the verb rather than to the 
noun to which they refer, which may occur anywhere in the 
sentence-generally, however, somewhere before the verb due to the 
OV character of the language. 


Examples (21) 


wada wanz-‘towz2rd the river’ (Amharic) 

ba aalgaa taach-under the bed’ (Amharic) 

‘ali la’agta ka gaad-‘Take the money from Ali (=Ali the-money 
from take) (Somali) 


A somewhat similar phenomenon is found in Kurdish, except 
that the ‘‘prepositional’’ element is suffixed rather than prefixed to the 
verb: haatim-aa shaarii - ‘I-came-to town’ (Kd.) 


The reverse effect we have come by now to expect on the 
other side of the VO/OV border is exemplified by Georgian and volga 
Finish (Cheremiss and Mordvin) which exclusively postpositional, and 
perhaps Swahili, which has a locative suffix -ni. Here we may also 
put the retention by Hungarian and Baltic Finnish of their post- 
positional character, deeper within the VO zone. In the vicinity, 
Latvian, Lithuanian, and even German have a few postpoitions 
(entlang, entgegen, gegeniiber, halben, halber, wegen). Finnish 
and Estonian have also developed a few prepositions, to blur the 
transition still further. 

We have as yet mentioned no comparable shadow effects across 
the eastern border. However, Chinese isofinterest here. It presents 


ijdl Vol. III No. 1 170 Masica 


certain classificatory...because first of all descriptive--problems (such 
as Thai, Cambodian, Malay, and other Southeast Asian languages 
do not). Relations such as those expressed in other languages by 
prepositions or postpositions are expressed in Chinese partly by noun- 
like words which follow the noun (wadzii - ‘in the room’, chwangsya- 
‘under the bed’, chyangshang-‘on the wall’), partly by verb -like 
words which precede it (dzai jyi-‘at home’ yan lu - ‘along the road’, 
gén wo-‘with me’). Just as the former may conceivably be rendered 
“‘room-inside’’, ‘“‘bed-bottom”’, ‘‘wall-surface’’, so the latter may be 
seen, especially in context, as retaining their verbal meanings 
(“located -home’’, ‘edging -road’’, ‘‘following-me’’; e.g. gén wo 
14i = “‘follow-me-come’”’ = ‘come with me’; dzwo féiji chyu = “‘sit- 
plan-go’”’ = ‘go by plane’.) 


Complicating the matter further, there are members of the 
verb-like class which seem to have no ordinary verbal function, such 
as tau’ng ‘from’, as well as an undoubted postposition in the “‘genitive’’ 
particle de (but is English ’s a postposition?) Finally, members of 
the two classes frequently occur together (dzai fangdzde héubyar- 
‘behind the house’ — located-house’s behind), analogous to the situ- 
ation in Pashto and Amharic. Perhaps the fairest thing is to call 
Chinese ambivalent, Po/Pr. If the verb-like forms are taken to be 
prepositions (or something other than veabs), the effect is often to 
leave another verb alone at the end of the sentence, thus pushing 
Chinese somewhat away from the VO-Pr norm in any case. 


The Lolo language appears to be postpositional (hi ka-nyi - 
‘behind the house’), while Khamti and Shan as well as Khasi and 
Palaung are prepositional, The line at this point therefore, unlike 
the OV one, neatly follows the genetic one, Tibeto-Burman vs. 
Tai/Mon-khmer. 


(D) The various modifiers of the noun which pattern alike 
in our language sample turn out to be separate variables when we 
trace them further. 


(i) The feature Adj + N defines an isogloss at considerable 
variance with OV, though still circumscribing the same general area. 
The main differences are a wide flaring out in the north, to include 
the Slavonic and Germanic languages, Chinese, Finnish and Hung- 
arian, Georgian, and, less securely, Greek, in addition to the OV 
languages, and a marked narrowing at the waist, to exclude Persian 
and Thjik (though not Ossetic, Pashto, or Balochi) on the one hand- 
also Kurdish and Tibetan, Garo, the Naga dialects, Khamti and 
Shan (and of course Khasi and Mon) on the other. Somali Eskimo, 
and Alevi) are also excluded. Burmese, Manipuri, Lushai, perhaps 
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Kachin and some other Tibeto-Burman languages waver. That is, 
to take the Burmese case, modifiers of nominal origin precede (sa 
dai’ —- “‘/etter building’’ = ‘post office’), while those of verbal origin 
follow (‘nwa pyau’ -‘lost bullock’, unless they have to do with 
‘fuse or cause’? (-au’ ye - ‘drinking water’) or use the particle -au; 
phrasal modifiers also precede, except for a special list, which follow. 
This confused situation partakes partly of the problem of defining 
adjectives or adjectivals in the first place, partly of the transform- 
ational nature of attributive adjectives, and partly of the fact that 
we are in a border situation. Two of the early insights of generative 
grammar, namely, that attributive adjectives are derived transfor- 
mationally from predicative ones, and that languages differ more in 
their transformational structure than in their basic phrase structure, 
seem to be born out when we compare the Adj + N isogloss with the 
OV and similar isoglosses. 


Greek, which permits adjectives in postnominal position 
under special conditions, may be regarded as transitional to Romance 
(N + Adj), and the latter, permitting prenominal adjectives under 
special conditions, as transitional in turn to the more rigid N + Adj 
order of Arabic and Berber. 


(ii) The Gen+N isogloss follows OV more closely, including 
Chinese, Georgian, and Finnish but stopping short of Slavonic. 
Persian, Tajik, Kurdish, and Somali are excluded, but Tibeto-—Burman 
remains unambiguously inside with IA and Drav. A slight shadow 
falls on otherwise excluded Greek, in the case of noun phrases 
containing other modifiers, or genitive phrases themselves containing 
modifiers (pime stis adelfin mu to spiti-‘Let’s go to my sisters 
house’). 


(iii) Dem-+N excludes Somali and Tibetan, as well as Thai, 
Indonesian, Swahili, Tagalog, Cambodian, and related languages. 
At two points along this mainly southern boundary, namely with 
Tigré and Tigrinya and with Malagasy, the typically border 
phenomenon of ambifixation occurs (dem+N-+decm). 


(iv) Num +N cuts out all Tibeto-Burman languages, 
Andamanese, and, most significantly, Sinhalese. The numeral may 
follow the noun under certain conditions or implications even in 


Tamil. 


(E) Ordinary adjectival qualifiers follow the adjective only 
in extreme right-branching languages: Arabic, Swahili, Thai, Cam- 
bodian. Javanese. 
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The qualifier of comparison, however, yields an isogloss 
useful in distinguishing languages of the SA type from mixed-type 
as well as right-branching languages. The SNA trait characterizes 
allIA and Dray. languages (including Sinhalese, Kashmiri, and 
Brahui), Munda, Tibeto-Burman and Altaic languages, Korean, 
Japanese, and Finnish, and finally Somali. Then follows a transitional 
type MSA, in Amharic, Balochi, Pashto, Tajik, Chinese, Khasi 
and partly in Persian. (Colloquial Tajik has also developed an 
SMA form.) Beyond these arethe AMS types of Europe (such as 
English), as well as of Arabic (and competing in Persian) and of 
further Southeast Asia, and unrelated constructions in Africa, 


(5) Conclusions 

It can be seen that the successive isoglosses of surface order 
features common to a sample of SA languages do define a definite 
area of the Old World, and ultimately set off within it a smaller 
region very closely coincident with what we might take a priori 
to be the “Indian linguistic area’’. This area is moreover not an 
area of transition (formed by the intersection of isoglosses), but a 
trait-core area, surrounded by concentric isoglosses. 


(A) First, the basic OV isogloss defines a massive middle 
segment of the world, a largely left-branching syntactic zone in which 
India is the main southern anchor. In keeping with its basically 
north-south trend, we might name this macrozone INDO-ALTAIC. 
It is flanked by two equally well-defined VO zones, the Euro-African 
and the Sino-Indonesian (perhaps Austroasiatic would be a better 
term), within which Greenberg’s category VSO appears in isolated 
fragments as an extremist subvariety. 


(B) Subsequent isoglosses, especially those involving the 
structure of the noun phrase, cut the OV belt almost in two at its 
waist and isolate the subcontinent as a separate sub-zone. 


(C) These subsequent isoglosses at the same time define three 
transitional subzones, where the basic OV syntax begins to give way 
to opposing phenomena: the Iranian, the Tibeto-Burman, and the 
Abyssinian. Of these, the latter two show the closer typologica] 
relationship to India, despite the genetic affinity of the first. This is 
especially apparent with regard to relative clauses, not used asa 
criterion here. To these three, we should perhaps add a fourth 
transitional zone across the VO boundary—China. 


(D) A thick bundle of isoglosses separates India from 
Southeast Asia beyond Burma, from Arabic, and from Africa beyond 
Ethiopia. 
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TABLE I 
Hindi, Telugu Thai 
Burmese 
OV VO 
Po Pr 
gen + N N + gen 
dem + N N + dem 
V + aux aux + V 
standard + marker + Adj Adj + marker + standard 
qualif + Adj Adj + qualif 
TABLE II 
Hindi, Malayalam Arabic 
OV vo 
Po Pr 
Adj + N N + Adj 
gen + N N + gen 
standard + marker + Adj Adj + marker + standard 
TABLE III 
Malayalam Amharic Swahili 
OV OV 
Po | Ambi/Pr Pr 
Adj + N N + Adj 
gen + N N + gen 
rel + N N + rel 
num + N N + num 
dem + N N + dem 
standard + marker / marker + standard 
+ Adj + Adj (verbal) 


(tense prefix) + V 


V + aux 
Adj + qualif 


qualif + Adj 
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There is more affinity with Europe and with China than with 
these areas. 


TABLE IV, V. P. 175 


(E) Of the two anomalies within the subcontinent, Khasi and 
Kashmiri, the former is only slightly intruded from the territory of 
its typological allies on the east, but the latter, although near the 
point of juncture of the Iranian and Tibeto-Burman transitional 
zones—defined in particular by N + Adj, a peculiarity it does not 
share—is far from any VO language. If retention of genetic heritage 
is the explanation in both cases, it can be noted that the Khasi Hills 
and the Kashmir valley share a notable degree of geographic and 
historical isolation. 


(F) On the northwestern border, Pashto and Balochi con- 
stitute a further transition between India and Iran proper-partially 
assimilated Iranian. 


(6) Speculations and comments 


These facts are so at variance with some cultural historical 
ones as to call for some comment. India has, for instance, exerted 
prolonged and intense cultural influence on Southeast Asia, yet the 
syntactic effects, as defined here, seem to have been nil. 


TABLES ie F-17175 


Moreover, direct linguistic influence was not the least among 
these influences, as evidenced by inscriptions, by a surfeit of loan- 
words from Sanskrit and Pali, and by the conventions of writing 
(Mon, Khmer, Thai, Lao, Javanese, and apparently, some Philippine 
languages are among those which have or had an Indian-derived 
script but no syntactic affinity with India). 


A parallel situation exists with regard to Chinese influence on 
Japan, Korea, and Vietnam, and, with some qualifications, with ° 
regard to Perso-Arabic influence on India itself—cultural domination 
for long periods, much lexical borrowing, on script, but little or no 


influence on crude syntax beyond the internal grammar of the loans 
themselves.” 


TABLE VII, VIII. P. 175 


It is not that the features in question are stonily unamenable 
to alteration—or require more time—cf. European Romany, or 
Hungarian. The problem is not without relevance for the question 
of the genesis of linguistic areas. They have usually been explained 


Masica 
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[pV ‘W'S xne+A N-+008 N+!{pv Od AO IpulH 
‘STWw/(py [pV 'S‘W Atxne/xne+A = Ue3-+N (PV+N Id (OA) AO ——URISIOd 
INA ATVL 
A+xne wop-+N uss-+N [py+N Id OA esoulvuyorA 
xne-+ A N+ wep N+ 003 N+{pv Od AO uvs10O Fy 
A+xne N+ wep N-+u03 N+f{py  4d/0d OA esouryD 
JIA @VL 
Jyenb+f{py wnu+N/N+unu wept+N uss +N [py+N Id OA gsouvavf 
jyenb+fpy wnut+tNn wop+N N-+ua3 N+fpv Jd OA iu y 
N+ 11 9 fpytjnenb N-+unu N+ wep N +008 N+{pv od AO nsnpeL 
IA ATAVL 
N+P fpytjrenb N+ uu N + Wop N+ 08 N+[pv 1g/Od OA esoulyD 
N+pr  fpy+jrenb  N-+unu N + wep N +003 N+!{pv Od AO nsnjaL 
PI+N fpy+jsyenb = N+unu N+ wep N+ u0s N+ {pv Od AO Iesucg 
7 A Alavi 
[py +jiyenb PIt+N N-+wnu N+ wap uos-+N (pV+N Id (A9) OA yousl 
[py +juenb PI+N N+wnu N+ aap us + N N+ pv (Od) Jd (AO) OA uBUIIO 
[py +jienb PI+N Nt+unag N-+ wep ues +N N+!pv Id OA uvISSNy 
[py +jienb II+N Nt+tunu N+ wep N+ ues N+ {pv id OA YSIPEMs 
Ipy +yienb PI+N Nt+unua N+ wep N+ 003 N+ {pv Od AO IpulH 
[py +gienb N+]. Nt+tunu N-+ Wop N+ 008 N+ {pv od AO nsnyIL 
AI ATEVL 
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as the product of extensive cultural intercourse, and/or by domination 
by a particular higher culture. Thus Sandfeld (1930: 214-215) 
attributes the formation of the Balkan linguistic area essentially to 
Greek (Byzantine) cultural domination. 


This theory has already suffered from the Sanskrit/Dravidian 
confrontation in India proper. We suggest that the evidence of the 
Indian and Chinese cultural colonies is conclusive: domination by a 
ruling culture, even for prolonged periods, is insufficient by itself to 
produce convergence of the sort under discussion here. 


Direct participation in the exchange may be confined to an 
elite, in which case borrowing may be selective and predominantly 
lexical. A more intimate, and less structured, sort of intercourse 
(involving not only extensive bilihgualism but common illiteracy?) is 
required to break down and reform structures in the manner chara- 
cteristic of a syntactic convergence area—an intercourse perhaps well 
illustrated in modern times by the wanderings of the Gypsies. 


Proper conditions may thus occur at any time, although they 
were perhaps more common at earlier periods of history. Examples 
generally involve the irruption fo a language from outside into an 
established syntactic zone and its subsequent assimilation or partial 
assimilation: Indo-European and Munda by Dravidian, Semitic by 
Cushitic, Tai by Tibeto-Burman-all of which oddly enough involved 
the same kinds of changes, more radical in the case of Semitic and 
Munda than in that of Indo-European. 


But what of the formation of the syntactic macroregions 
themselves? Such regions appear to be long-established and 
relatively stable-apart from such modern phenomena as the dis- 
placement of languages by the colonial expansion of Russian, 
Chinese, English, or Spanish. Are they just a coincidence? It has 
been objected that correlations involving only two variables, such as 
Object and Verb, cannot be very significant, since only two possi- 
bilities exist, each presumably with a 50-50 chance of occurring. 
This ignores the fact of their massive and overwhelming areal 
patterning, their correlation with other variables, and the possiblity 
of other, compromise alternatives, which characteristically turn up 
along the border between two macroregions. Whatever their 
explanation, the syntactic macrozones do exist as significant 
entitites. 


Any racial hypothesis is ruled out, as the zones cut across a 
bewildering number of racial lines, as they do climatological and 
other lines. 
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Resemblances between some of the components of Indo- 
Altaic, such as those between Altaic and Dravidian, were of course 
noted at least a century ago by such scholars as Caldwell, who 
tended to give them a genetic interpretation. Even the Ethiopian 
appendage was noted indirectly by Praetorius (1879): 


“War sich mit den hauptsidchlichsten anderen semitischen 
Sprachen etwas eingehender berchaftigt hat und dann zum 
Amharischen iibergeht, dem muss notwendigerweise... 

zunachst und am nachdriicklichsten ins Auge fallen, 

dass alles was wir aus den iibrigen semitischen 

Sprachen als Princip der semitischen Syntax erkannt 

zu haben glauben, im Amharischen sich nich etwa 

blos in volliger Aufléstng befindet, sondern zum 

Teil gradezu in das Gegenteil umgeschlagen ist. 

Wir empfangen den Eindruck als habe ein Tarke aus 
seiner Sprache unter Beibehaltung der tiirkischen 

Wortstellung ins Semitische iibersetzt...’’ 


For “Turkish’’, he could have substituted the more relevant ‘“‘Dra- 
vidian’’, had he been aware of that dimension. 


We are thus merely filling in and putting together, and stressing 
contiguity of all these phenomena. 


We can only speculate that prehistoric conditions may have 
favored contacts among these presently widely-stretched peoples, 
and the intimate commingling on a large scale of smaller populations 
that could produce the results discernible to this day. Once these 
special conditions, whenever they might have occurred, were replaced 
by more stable ones, perhaps the traits thereby acquired became 
fairly resistant to further change, barring migration into another area. 
Subsequent historical vicissitudes, whether bringing influences in 
harmony with the established territorial norm, as in the Balkans, or 
in conflict with it, as in Southeast, Asia or Korea, apparently rarely 
approached a re-creation of these conditions to an extent that greatly 
affected the established syntax. 


Such significant early contacts between the several Indo-Altaic 
subregions are not entirely implausible. Let as consider the Ethiopian 
appendage, for example. According to the geographer of plant 
origins, (Carl Sauer (1952), the Ethiopian highlands have been 
discovered to be “‘one of the world’s oldest and greatest centers of 
domestic seed plants.’’® | Here was the ancient source of bajra, ragi, 
and other millets, of sesamum (til), of anumber of pulses, and 
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possibly of cotton itself—all of them specifically basic to the folk- 
economy of western India. Contact may have been by sea, or as 
Sauer proposes, via the little-known southern coast of Arabia. (A 
modern-day descendant of South-Arabic, Soqotri, shows certain 
syntactic agreements with Ethiopia and India, for example in relative 
clauses. See Leslau 1945.) 

Can the formation of syntactic macroregions be linked to the 
establishment of settled modes of economic life? 


We might propose even wilder theories, but let us return to 
our conclusion: it has proved possible to define the Indian linguistic 
area on the basis of syntactic criteria alone. Whether criteria of 
other types would confirm or contradict this definition or even 
pattern at all is a question for further study. 


NOTES 


1. In this paper, Bach incidentally lists Hindi among ‘languages of 
Type III’? showing no agreement between object and verb, but 
Hindi does have such agreement in the so-called ergative or 
ne-construction. 


2. Bray (1909, 1934) being difficult to obtain, I have had to rely 
temporarily for Brahui on Konow 1906 and Caldwell 3rd ed 1913 
(repr. 1961). Cf. Also Zograf 1960:93-4. 

3. Although conscious of its inadequacy, I might nevertheles 
give the results here of a spot check involving short stories or 
passages by Hindi-Urdu and Telugu writers. 95% of the Telugu 
and 72% of the Hindi-Urdu constructions had the object 
before the verb. Of the contrary cases, most involved clausal 
objects, especially quotations (‘‘He said that...’’, ‘He thought 
that...””> or even ‘‘He_ said,’’ -the latter doubtless in 
imitation of written English and preferably excluded from the 
statistics altogether, thus removing most of the T counter 
examples.) Only one T example involved a true noun phrase 
object: testaaDu Dabbu--‘He’ll bring the money’. In the H 
sample there were six of these. These were augmented by 
postverbel clausal objects in ki (see Note 11 below), something 
which is normative in H but quite foreign to T structure. Also 
of interest may be the fact that the subject, where it was 
mentioned, followed the object (order structures OSV or OVS) 
in 53% of the T cases, but only 17% of the H cases. This 
may merely reflect the stylistic predilections of the authors 
(Southi Krishnamurti and Kommuri Venugopala Rao Vs. 
Premchand and Mohan Rakesh), but I suspect it reflects a 
a real stylistic difference between the two languages. 


wr 
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4, 


Eee sentences all have the order OV (GV, CV, etc.) 
only. 
“‘Adjectives’’ as a special word-class are of course absent from 
many languages. It will be necessary, in seeking a more 
universal category for the comparative syntax of the noun 
phrase, to fall back on something like “adjectival”, defined in 
terms of derivation by transformation from a certain subtype 
of predicate. 
Cf. Pinnow 1960. A Sora example is: 
(free) aninji Ji:n ga:teji ‘they /liqguor/drink’ 
(bound) ga:-sa:/]-tenji ‘drink-Jiquor (they +tense)’ 
(from Ramamurti 1931 : 156) 
However, the Hindi-Urdu clausal object (or complement) con- 
struction involving the Persian conjunction ki must owe 
something to Persian influence in the Islamic period, as may 
the Bengali, though it uses the native element je: 
tum pahle bhii jaante the ki jagah sthaayii nahit hai- 
“You kneweven before that that position wasn't permanent’ (H) 
ami jantam na je se phire eseche-‘I didn’t know that he has 
returned’ (B) 
Senidam ke uu xaahad aamad —‘I heard that he will come’(P) 
(Cf. Telugu: miiku iTiiwala udyoogam dorikind-ani, naaku 
telisindi- ‘I found out that you got a job recently’ ) 


Whether the inversions of head and modifiers common in Hindi 
Urdu poetry are due primarily to Persian influence is a subject 
for further investigation. 


Sandfeld’s theory (See Sandfeld 1930: 214-5) has the merit 
some of these accommodations are known to have occurred 
during the Byzantine period However, the changes (such as 
the loss of the infinitive) often involved Greek also, which 
emerges as much a victim as a cause. 

See Sauer 1952*. I must thank Mrs. Claudia Turner for 
calling my attention to the fact that Sauer 1952 is not exactly 
the last word onthe subject. Further investigation on my 
part has revealed a considerable measure of disagreement bet- 
ween the archaeologists and the botanists, although the 
objections of the former are weakened by the fact that much 
of Ethiopia is admittedly unknown archaeologically. The 
consensus seems to bethat, if Ethiopia was not a major 
center of domestication (of wheat, barley, and cotton) as 
Sauer or rather Vavilovs would have it, it was at least a 
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minor center, and certainly a center of diversification. More 
important, it was either the site or very near the site of the 
domestication of jowar, bajra and ragi (Sorghum vulgare, 
Pennisetum spicatum and Eleusine coracana), and the source 
of their transmission to India, sometime after 1500 B.C. — 
which is what primarily concerns us here. Harris notes that 
no general theory has been advanced to replace Sauer’s. 


For further references, see Clark 1962*, Giiterbock 1968*, 
Harris 1965*, 1967*, Kraus 1968%*, Murdock 1959*, Porteres 
1962*, Seddon 1968*, Simoons 1965*. 
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BILINGUAL’S GRAMMAR: A CASE STUDY OF 
TAMIL - SAURASHTRI NUMBER NAMES* 


P. B. Pandit. 


The number of languages in a country is not always an index 
of barriers of communication in that country. A _ multilingual 
country should not be interpreted as mere juxtaposition of a number 
of monolingual territories. In a large multilingual country like India, 
continuous movement of people and ideas over along period has 
resulted in a diffusion of linguistic features across language families. 
One of the significant features of multilingualism in India is the 
existence of stable bilingual or multilingual communities. People in 
large metropolitan centres or district towns maintain their language 
identity for generations, despite its ‘minority’ status. They speak 
their own language in their domestic settings and such other dealings 
where the speakers of ‘minority’ language come in contact with each 
other and they speak the majority language in other contexts. This 
continuous code-switching has resulted in striking similarities in the 
grammatical structures of these languages at colloquial level. Tamil- 
Saurashtri bilingualism is such a case of linguistic convergence under 
conditions of stable bilingualism. 


Tamil belongs to the southern group of the Dravidian 
languages family. Saurashtri belongs to the western or south- 
western group of IA languages.1 It seems that this IA speaking 
community migrated from the west coast of Saurashtra (Gujarati; 
western Indo-Aryan) some time in the Ilth century, stayed in 
Marathi (south-western IA) speech area for about two centuries, 
and then moved further south, first in Telugu speech area and event- 
ually settled down in Tamilnadu after the 16th century. The 
southward migration of this IA speech community was mainly due 
to the religious persecution of the Muslim invaders. 


* T am grateful to Austin Hale of the Summer Institute of Linguistics (Nepal) 
for comments and suggestions. 
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Traditionally, the Saurashtris are a community of weavers, 
chiefly silk-weaving, (at present, the textile and handloom industry). 
According to the 1961 census the speakers of Saurashtri number 
155392: they are mainly concentrated in the districts of Selam, 
Triruchirapalli, Tanjore, Madurai and Ramanathapuram in Tamilnadu. 
About 51 percent of the Saurashtri speakers are concentrated in the 
city of Madurai. 

The Saurashtri speakers tend to cluster together in residential 
districts, they also have different customs and dress (especially 
women), normally they do not have marriage relations with Tamil 
speakers; these are some of the factors which have helped the 
language maintenance. Saurashtri does not have script; besides 
some ritual occasions, there is hardly an occasion to write in 
Saurashtri; Tamil script is employed (with diacritics) to write 
Saurashtri; Saurashtri is not taught in the schools; all Saurashtri 


children go to Tamil schools and for all practical purposes of 
written communication Tamil is used. 


The speaker’s choice of language in such a situation is 
governed by social factors; this is a special case of diaglossic situa- 
tion, though the languages belong to two different families. A study 
of the phonological and grammatical system of Saurashtri indicates 
that the grammatical storage of the bilingual speaker in such a 
bilingual community may consist of one set of rules applicable to 
both the languages.5 The syntactic structure and the morphological 
categories of the two languages are remarkably identical. i have 
elsewhere shown (Pandit 1972) that Saurashtri grammatical structure 
closely resembles the Tamil grammatical structure. Historically, 
Saurashtriis an JA language. The lower level rules i. e., lexical 
entries are different for Tamil and Saurashtri; but the syntax is 
largely identical. It is also noticed that a Tamil speaker picks up 


colloquial Saurashtri in about four weeks stay with Saurashtri 
informants. 


The set of number names in a language can be easily detached 
and analysed independently. The grammar of number names of 
the three languages Gujarati, Tamil and Saurashtri is presented 
here. The Gujarati and Tamil grammars are presented to bring out 
the general similarities and differences between IndoAryan and 
Dravidian number systems. The base rules for both are almost 
identical; but Tamil and Saurashtri number systems are more 
identical; they can be generated by the same model, though the 
terminal vocabulary in both is different. The rules for the grammar 
of Gujarati number names are presented in details and they are 
followed by Tamil and Saurashtri rules. 
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The set of number names consists of a limited lexicon and a 
limited number of rules which produce the infinitely denumerable 
set of number names. The rules must produce only the acceptable 
number names, and they should also provide for possible synonymous 
expressions of the number names, e.g. the following three are 
synonymous in Gujarati:3 


ponder so | ‘fifteen hundred’ 
dodh hojar ‘one and a_ half thousand’ 
ek hajar pac so ‘one thousand and five hundred’ 


The above number-(1500) can also be stated by merely 
reproducing the digits in the order in which they are written-thus: 
ek-pic-midii-midti- (one-five-zero-zero-); but this is not a number 
name, it is a dictation label for number names. We have therefore 
not taken into account this type of number description. 


For all practical purposes the highest number name unit in 
Gujarati is abaj i.e. 1000000000. Units above this can be made 
available from any Sanskrit lexicon, but they are never used; we 
have therefore stopped at sbaj in our lexicon. 

The conventions in generative grammar regarding the pre- 
sentation of rules have been observed. Thus, — , ( ) and 

} respectively indicate ‘rewrite as’, optional inclusion and the 
obligatory choice of included elements; : is used to indicate introdu- 
ction of terminal symbols.——~—— is used for T rule (transformation 
rule) as well as for morphophonemic rule; as we have only one T 
rule in this grammar, there is no chance of confusion. I have 
followed A. Van Katwijk’s model ‘A Grammar of Dutch Number 
Names’ (Foundations of Language, 1.1. 1965) with suitable alter- 
ations for describing the Gujarati number names. 


GUJARATI 
Non-terminal vocabulary: Ry, R,; Re Rs Rg 
D1. D2. D2 D* 
Yi Yo Ys Va 
1 Xl Xa 


Terminal vocabulary: midi (sun) 
ek, be, tron, car, pac, cha, 
sat, ath, nov (1-9) 
dos, ogyar, bar, ter, covd, 
ponder, sol, sottor, 
adhar, ognis (10-19) 
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vis (20) 

So (100) 

hajar (1000) 

lakh (100000) 
korod (10000000) 
aba] (1000000000) 


pa (1/4), ordho (1/2), pono (3/4). 


Production rules: 


(3 zero 4) 


| te | 
1, Integer + »6([3 J) \ 
lye (Ha) Ys | 


LR; J 
Of the four possible applications of this rewrite rule, the first gives 


us zero (midi-sun); the second and third supply us with the alter- 
native synonymous expressions referred to above; these three will be 
taken up later. The fourth, R; can be expanded as follows: 


2. Ri—> ((¥4) ¥3 D1) R,-1 
Rule 2 is recursive so that R1 (i.e. Ro R, Re Rs and Ry) 
can be introduced again on the left, e. g., 
Ri ( (Ya Hs D4) Ry 
Rs—— ( (¥4) ¥3 D%) Re 
Ro— ( (¢4 ¥3 D?) Ry 
R1—> ( (¥4) ¥3 D1) Ro 


Sos seer ae (v4) Y3 


4. y— (IC) 
xl 
XX X | 
I 
Xx 
ee (4 x ) 
XX (1) 


~~] 
ea 
rs 
em, 
HE] CO RO] pd 
a” 
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Now we can introduce the lexicon. The entries should be in 
braces and when there are more than one item they ought to have 
been arranged vertically, one above the other e.g., 


ek 


to indicate obligatory choice of the included elements. We have, 
instead, omitted the braces, and entered the items in a conventional 
horizontal line for printing convenience only. 


I : ek, be, tron, car, pac, cha, sat, ath, nov 

X : dos 

XI : das + ek... nov 

XX(I) : Pa fo) eae See nevu (+ ek......... nev) 

C : SO 

D1 : hojar 

D2 ; lakh 

D3 : korod 

D4 ; abaj 
Morpho phonemic rules: 
pa + ek ++  sova 1i 
pa + be —+— sava be 2+ 
upto 
pa + novvantl + Sova novvanu 993- 
ordho + ek po dodh 14 
ordho + be + odhi 24 
ordho + tran — + sada tron 34 
ordho + car + sada car 4 
upto 
ordho + novvant as sada novvantl 994 
pono + ek + ponoi.e. ‘a quarter less from one’ 
pono + be eee pona ts ‘a quarter less from two’ 12 
upto 
pono + novvanu + pona novvanti ‘a quarter less from 


ninety nine’ 982 


Note that of the three fractions, pa, ardho and pono, the last two, 
ardho and pono are variable adjectives agreeing with gender of the 
following noun; if the noun is followed by a postposition, they are, 
as variable adjectives, represented by oblique forms ordha and pona. 
These are the rules of Gujarati grammar, applicable to variable 
adjectives and nouns, and hence they are not discussed here. 
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T Rule: 
e.g. vis + ek —+— ek vis 


vis + be —+-— ba vis 


etc. with appropriate morpho phonemic rules such as: 


pet + vis —+—+  bavis (22) 
tron + vis —+— trevis (23) 
car + vis —+—+ covis (24) 
ath + vis —+-—+ otthavis (28) 
be + tris —+—  botris (32) 
pac + tris —+—+ patris (35) 
be + sitter —+—  boter (72) 
tron +nevu —+— trand (93) 


etc. etc. which would cover the numerals from 21-99 (other than 
The tens, i.e. 10, 20-302. 90). 


Note that the set of numbers 19, 29, 29, 49, 59, 69 and 79 consists 
of the element ogan: ‘one less than’ 


dos + nav —+— ognis ‘one less than twenty’ (19) 
vis + nov —+—+ ogontris ‘one less than thirty’ (29) 
tris + nov —+-—+ ogancalis ‘one less than forty’ (39) 
sitter nov —+— ogonési ‘one less than eighty’ (79) 


The set of numbers 89 and 99 consists of the elements 80 + 9 and 
90 + 9 respectively, 
esi + nov +—  nevvasi ‘eighty and nine’ (89) 
nevu + nov —+— novvani ‘ninety and nine’ (99) 


We can now take up the base rules (production rules) which 
produce (i) zero and (ii) synonymous number names, 


(i) Integer —+— zero 


Zero : sun, midi 
is self explanatory. 


i Ye 
(iia) Integer ++ wy ( ) 
Y1 


This rule blocks the ungrammatical number names such as: 
“sada car so pac 
*sova nov so sat 
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When ¥, precedes either ¥. or ¥,, the occurrence of v3 is blocked by 
the above rule. The above rule produces the strings 


( Sova bar so (1225) 
Ys Yo —> 4 sada pScasiso (1550) 
Pona paccis so (2475) 

sova pac so (525) 


i Sada nov so (950) 
pona nov so (875) 


Note that this rule also blocks ungrammatical numerals *sada tris so 
because the absence of ¥3 prevents XX (I) to appear here, wv. permits 
only XX+I; thus the tens (i. e. das, vis, tris......... nevu) are blocked- 


(jib) Integer—> po (v4) vs 
This generates strings such as: 
pondor so tris (1530) 
novvant so nevvani (9999) 
Note that the absence of initial ¢%4, blocks ungrammatical strings 
such as: 
*sada pondor so batris ; 
but the introduction of (v4) medially, permits strings such as: 
pondor so sada botris (15323). 
Note that all the strings generated by this rule can also be generated 
by R, rule; thus the numbers such as: 
pondor sa batris ‘fifteen hundred thirty two’ 
ponder so sada botris ‘fifteen hundred thirty two and a half’ 
can also be expressed as : 
ek h¢jar pac so botris ‘One thousand five hundred thirty two’ 
etk h¢jar pac so sada botris ‘One thousand five bundred 
thirty two and a half’ 
This rule, then, is an optional production rule which allows certain 
numerals to be stated in an alternative way. 
Some comments on the main base rules may be in order. 
1. Integer —~> R, 
Zi Ry Sara ( (4) ¥s Di Re 1 
This, as stated earlier, is the recursive rule. R, can be expanded 
as R, Rs, Re, R; and Ryo. Ro cannot be recursive - it does not 
appear again on the right. This prevents infinite repetitions of 
numbers in a string. 
Je RG —> 1: (4) Ve 
This will block strings such as: 
*sada cho so paccis, 
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by (4) placed before 43 we can get: 
cha so sada poccis (6254), which isa grammatical numeral. 


The above rules of derivation can be illustrated in a tree 
diagram as follows: 


Re 


7 Yer 
Yq V3 DB R3 


We 
w,: hundreds D1: thousands 
v2: tens+hundreds D2: hundred thousands 
wg: tens-+ones D3: ten million (American) 
%4: fractions D+: billion (American) 


All illustrative examples, i.e. Gujarati number names, would 
be sub-expansions of this, e.g. ekotris obaj, tran karod, ognis hojar, 
sada nevyasi (31; 03,00, 19,0893). 


Saacuie:. 1 > 

<Rule 2> 

<Rule 2> 

<Rule 2> 
ewe <Rule 3> 
« ; t Ry 


Vy" Xe “ 
XXI obaj I korod XI hojar IC ardho XXI <Rules 4-7> 
{ } | (Terminal string) 
IXxX IX <T-Rule1l> 
The step by step application of the above rules to derive the 
above number name is as follows: 
1. Integer Ite R; 
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Ri > Gta) DY) R; - 1 
desist oN (¥4) ¥3 D*) Rg 
Ys —- XX (1) 
T rule 1 >: XX (I) +> (1) XX 
IXX : ekoatris 
D4 6OD9] 


We have, thus, derived ekatris sbaj from w3; D4. We can now 
expand Rs. 


Rs reg ((¥4) ¥3 D8) Ro 
v3 Se I 

I tron 

D3 ; korod 


We are, thus, working down step by step, from the highest unit to 
the lower onces. We have now, ekoatris abaj, tron korod. Expanding 
the R, 
Rg fete. Ys D2) R, 

Items in parenthesis are optional, we take the advantage of this 
choice and do not expand ws, or D? to the terminal symbols because 
we do not need them (we do not need any D?: lakh, in this number 
name). We, thus, proceed further. 


Ry — ((44) ¥3 D1) Ro 
Ys See xl 
XI ; ognis 
D} : hojar 
Thus, ekotris abaj, tron korod, ognis hojar. 
Ro Ter ata: ¥, (4) vs 
wy — (IC) 


We do not resolve this parenthesis, because we do not 
require to expand a C. 

Note that in all the previous rewrite formulae we chose not 
to resolve the parenthesis containing (4) the fraction. We did 
not require it. Now, we require it, so we resolve the (¥4) in this 
rewrite rule. 

W4 —> erdho 
Later on, by an application of a morpho phonemic rule we arrive 
at sada in place of ordho. 


T rule XXI -—— IXX 
IXX —> novyasi. 
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Non-Terminal vocabulary : 
R O Ry 


Terminal vocabulary : 


I: onnu, rendu, muunu, naalu, anju, aaru, 


ombadu (1-9) 


eelu, 


Pandit 


ettu, 


X : pattu, irubadu, muppadu, naappadu, aymbadu, arubadu 
elubadu, embadu, tonnuuru (10-90) 


X 1. : padinonnu 
pannirendu 
padimuunu 
padinaalu 
irubattonnu 
irubatterendu 


(11) 
(12) 
(13) 
(14) 
(2%) 
(22) 


irubattumuunu (23) 


similarly, upto 


cS : nuuru 
Di : aayir6 
D2 : laccé 

D3 : koodi 


Production rules: 


1. Integer —~> 


pad 2c) — 
Rs —> 
Re > 
Ri —_ 
3. Ro —- 
4. wy —- 
5. the —> 


(99) 


(vo D1 Q) Ro 
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Morphophonemic rules : 


Q functions as a connector. Qis realised as tu, ttu, or ee 
according to the differing environments: 
nuuru + Q —>——  nuttu 
aayir6 + Q —>— = aayirattu 
lacc6 + Q —~+>—— laccattu 


? 


Forms such as aayirattu ‘thousand...’ or laccattu ‘lakh...... , 
occur only when they are followed by lower numbers e. g. 


nuuru + pattu——>—— nuutt (u) pattu ‘one hundred and ten’ 
aayiro + nuuru—~>—— aayiratt (u) nuuru ‘one thousand and 
one hundred’ 
lacc6 + irubadaayir6 —-»—— laccatt (u irubadaayird 
‘one lakh and twenty thousand’ 
Further, the elision of (u) in the above will be stated by the general 
sandhi rules of the language. 
Note that the connector tu is selected only when it connects 
C, D1, or D2 numeral (100, 1000 or 100,000) with some other 
lower numerals. 


We can state this rule as: 
C Cc 
1. D1 +Q +R, —— 1{D!} +tu+Ro 
D2 D2 
The other connector ee is selected when the numeral koodi is 
used. The following rule states the use of ee 
2. D3 +Q+R, —-——> D3,+ceR, 


e.g. pattu koodi anji lacc6 ——~—> 
pattu koodiyee anji laccéd: 


‘ten crores and five lakhs’ 


oru koodi irubadu laccé —~»—~> 
oru koodiyee irubadu laccd: 


‘one crore and twenty lakhs’ 
ee is also selected when the fractions kaal ‘}’ and mukkaal ‘%’ are 
added : 


Pandit 
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kaal 
ae O+ | iukkel ee 
ee kaal 
pe | nakkan 
e.g. anji kaal —+-— anjee kaal ‘SZ° 
pattu mukkaal -+—pattee mukkaal 102 


but no connector is required with, are: ‘3’ 
e.g. ettu + are —+— ettare ‘8i" 
This deletion of Q can be stated as a rule 
4. Q --— @/— Z 
The numbers in the set of tens, 20-90, which have a final-du, 
replace this-du by the connector-ttu when the unit numbers such 
as onnu, rendu (1-9) are added. The final —u (of -ttu) is further 
elided by the general sandhi rules. Thus, 
5. X¥+Q+I1 —> —> X + ttu+l 
nappadu + Q + aaru —>—> naappattaru (46) 
aimbadu + Q + readu—»>—~aimbatterendu (52) 
Another suggestion regarding Q, is that morphophonemic rule 
regarding Q may be eliminted if Q is given subscripts just as 
D and ¥. ‘ihe maximum expansion is: 


te’ pt Nay % oe 
4% Yaa: 


In this case, the lexical features would be: 


ce = Os 
tu = Qs, Q; 
ttu=Qo 


The revised rules would be, 
2, eons D4 Oy] Rae 
3; Roe ¥q.. Qo Ye [Qs v3] 
The Tamil rules differ from the Gujarati rules in the following 
way: 
i) Absence of such sequences as ‘eleven hundred, fifteen 
hundred’ etc. in Tamil obviates the need ofa ¥, wy in 
the base rules of Tamil. 
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ii) The fractions always come at the end in Tamil numerals. 
Moreover they are always additive unlike Gujarati where 2 
in a ¥, ¥3 i.e. pona car is interpreted as ‘one quarter less from 
four: 33. The complicated rules regarding fractions are not 
required for Tamil. 


iii) Tamil numbers such as twenty one, thirty five etc. (11-99) can 
be generated from the base rules without any T rule - unlike 
Gujarati where a T rule XXI —-— IXX is required for ekvis 
(lit. ‘one-twenty’) etc. (see page 10); Tamil numerals (21 - 99) 
are ‘twenty and -one’ type. 


iv) Tamil requires a connector Q in the base rules and morpho- 
phonemic rules for the selection of the connector; Gujarati 
does not require a connector. 


We have not formulated two rules for Tamil regarding the set 
of numbers 10, 20..... 90, which can be interpreted as 10, 
2x 10,3x10...9x 10; the forms pattu:10; irupadu: (20) 
etc. can be interpreted as 2 x lU etc. Another morphophonemic 
rule required for Tamil is to state the formation of tolaayram 
‘nine hundred’ which can be interpreted as ‘one hundred less 
than one thousand; aayram is ‘one thousand’ in Tamil 
(Radhakrishnan 1963: 110-22). 


The Saurashtri rules for number names, given below indicate 
that Saurashtri shares with Tamil all the above four points of 
difference between Tamil and Gujarati. Saurashtri and Tamil 
have, therefore, identical grammar of number names, though 
the lexicon of Saurashtri is mainly IA. 


SAURASHTRI 


Non-terminal vocabulary : 
Ro Ry Ry 
C D1 D? 


Yi Yo Ys 
Lo ee XX (1) 
Q 


Terminal vocabulary : 


I : onte, diye thinye, caarye, paacye, sovye, 
saatye, aathye, novye (1-9) 


Mm  OEsye,; 
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XI : vigaarye, baarye, terye, caudye, phandarye, sovalye, 
satarye, tiviniviis, oviniviis (also egun viis) (11-19). 
X (I) :  viis, viis un onte, viis diye,... 
viis un novye (20, 21, 22.. 29) 
triis, triis un onte... (30,31... ), 
Similarly, caalisye (40), pannaas (50), saat (60), 
sattar (70), assi (80), desunno onte so (90). 


(20-99). 

CS 85 (100) 

D1: sasar (100,000) 

D2: leku (100,000) 

Production rules: 
zero 
1. Integer — fe ¥3 
3 
R, 

2 Ri  —- (¥%3 Dt Q) R,-1 
Re ap (Ys D2 Q) Ry 
Ry + (3 D° Q) Ro 
Ro —+ (¥; Q ve (Q ¥3) 

ay 


| Xx (I) 
fija | 
5. 3 > < 1/2 


Morphophonemic rules: 


1) XX+Q+1 —+— XX un I 
€. g. viiS+onte —+-—Viis un onte ‘21’ 
triis +onte ——triis un onte ‘31’ 


Be: re 
ii) {Dt $+Q+R, ++ 4 Di b +holle+R, 
lp: J Lp: J 
@.g. Onte so onte —+—onte so holle onte ‘101’ 
onte so kaal -+—+-onte so holle kaal ‘1002’ 


Onte sasar onte so -—+—onte sasar holle onte so ‘1100’ 
onte sasar wonnukkaal so+— onte sasar holle 


wonnukkaal so ‘11004’ 
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The above selection rules for the connector are comparable 
to similar rules for Tamil, though the lexical items (connectors) 
and the restrictions are different. There is one more difference: if 
the first number is thousand, the following connector could be 
either holle or un e.g. onte aasar onte —-— onte sasar un onte or 

Onte sasar holle onte ‘1001’; 


Tamil does not permit such variation. 
The highest digit for Saurashtri is sasar ‘thousand’; while the 


highest for Tamil is kooti. Note that the digits above thousand in 
Tamil, lecc6 and koodi, are loanwords from Indo-Aryan. 


The four characteristics in which Tamil differed from Gujarati 
are shared by Saurashtri with Tamil. Thus, in all the following 
respects, Saurashtri agrees with Tamil: 


(i) Saurashtri does not permit sequences such as ‘eleven 


hundred’ (as alternating with ‘one thousand and one 
hundred’); 


(ii) its fractions come at the end and they are additive; 


(iii) the numbers 21-99 are of the ‘twenty - and - one’ 
type and 

(iv) it requires a connector Q. 

Saurashtri lexical items for the fractions + and 3 areloan- 


4 
words from Tamil; while the item for 4 is Indo-Aryan. 
va kaal 
1 hoddu 
= mukkaal 


There are some more morphophonemic rules for Saurashtri 

fractions which may be stated here: 
onte hoddu —»~——~ dedh (12) 
(cf. Gujarati dodh p. 9) 

The numerals 125, 150 and 175 in Saurashtri can be stated 
by the alternating forms, e. g. 

onti so holle vis un paac or wonnukkal so 

Onti so holle pannaas or dedh so 

onti so holle sattar un paac or wonnumukkal so; 
but unlike Gujarati, these alternating forms are permissible only 
when the highest digit is hundred e.g. upto 900. ‘one thousand five 
hundred’ cannot be translated in Saurashtri by ‘one and half 
thousand’ as one does in Gujarati ‘dodh hojar’,; the permissible 
Saurashtri form is only ‘onti sasar holle paan so’ 
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The most peculiar feature of the Saurashtri numerals, which 
is neither found in Gujarati nor Tamil (nor, indeed in any 
number system in Indian languages) is the count from 90 to 99. 
The Saurashtri numeral for 20 is 

des unno onte so ‘ten less from one hundred’ 


Then the count proceeds on the same lines upto 99 in the same 
fashion, 


91. : nov unno onte so : nine less from one hundred 
92. > 7 taaltye unno onne S) \. Sipiipeae 3. G, fi 
93,-:-s4)saatyeunno Onte So. "Seven 0... 
94. : soye unno onte so etc, 8 i 53 
95. <3: paaecye nnd onte’so.~ > Aves ee ., = 
96. : carye unno onte so : hour eaeds ys 5, te 
97, = ove nnd onte.so. . 3 Reet 3 a 
98. : diye unno onte so  AWO nee ee is 
99. : onte unno onte so SONG) gees 5 s 


A holistic approach to language may not explain the 
Saurashtri - Tamil type of bilingualism. The Saurashti-historically 
an IA language - may have its syntactic structure closely akin to 
Tamil; large numbers of utterances of both the languages can be 
generated by the same set of meta-rules. The grammar of number 
names is one example of such set ofrules. Similarly the phonolo- 
gical rules may also be identical, while the lexical selection and 
morphophonemic rules may be different for the two languages. 
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4. Gujarati and Tamil are presented in conventional phonemic 
transcription. Saurashtri has also been transcribed in conven- 
tional phonemic transcription; but it should be noted that the 
bilingual’s phonology presents some problems e.g. minimal pairs 
are few, incidence of ‘free variation’ is much greater in Saurashtri 
e.g. medially, voiceless and voiced stops vary freely in many 
cases; so does vowel length. These ‘free variations’ may not be 
‘free’ if viewed in a sociolinguistic framework. 
Here, we have transcribed all data in conventional phonemic 
transcription, 

5. For adifferent model of generative grammar of Tamil number 
names, see S. Agesthialingom ‘A Generative Grammar of Tamil’ 
Annamalai University, 1967. pp. 55-73. 


6. R. Radhakrishnan ‘Tamil Numerals’ Grammar’ Indian 
Linguistics 24 (1963) pp. 110-122. 


NOTES 

1. Earlier work in Saurashtri: Randle 1943-46, Ramasubramanian 
1968. 

2. John Gumberz has introduced the notion of repertoire and 
discussed it in his papers (1966, 1967). 

3. Gujarati and Tamil are presented in conventional phonemic 
transcription. Saurashtri has also been transcribed in conven- 
tional phonemic transcription; but it should be noted that the 
bilingual’s phonology presents some problems e.g. minimal 
pairs are few, incidence of ‘free variation’ is much greater in 
Saurashtri e.g. medially, voiceless and voiced stops vary 
freely in many cases: so does vowel length. These ‘free vari- 
ations’ may not be ‘free’ if viewed in a sociolinguistic frame- 
work. 

Here, we have transcribed all data in conventional phonemic 
transcription, 


Due to want of space the next number will carry as continuation 

the following items: Ed: 

Franklin C. Southworth, Linguistic stratigraphy of north India 

Helen E. Ullrich-Baylis, Morphological coexistence: A key to 
linguistic convergence 


General bibliography 
Note on contributors 


ijdl Vol. UI No. 1 


198 


PUBLICATIONS RECEIVED 


R. L. Trail 


P. Somasekharan Nair 


Jawaharlal Handoo 


Ja Devg Singh 


Indira Manuel 


P. Doctor Naciimtiin 


Bharathi Tamil Sangham 


A. Rajendran 


P. Keesava Deev 


H. P. Mckaughan and B. A. 


Macaraya 


S. M. L. Lekshmanan Chettiar 


: Patterns in Clause, Sentence and 
Discourse— in selected languages of 
India and Nepal, Part I Sentence 
and Discourse. Part II Clause; 
Part III Texts; Part IV Word Lists. 
S. L. L. Publication No. 41 Univer- 
sity of Oklahoma, Norman, U.S.A., 
1972. 


: ‘Bhaashaa Bheeda Vignaanam’ 
(Malayalam) N. B. S., Kottayam, 
1973 Rs. 4. 


: Kashmiri Phonetic Reader C. I. 
I.L., Mysore 6, Phonetic Reader 
Series 8, 1973 Rs. 8. 


: A Descriptive Grammar of Ban- 
gru, Kurukshetra University, Kuru- 
kshetra, Hariyana, India 1970. 


: Paalai paatiya Perunkatunkoo 
(Tamil) Jayakumari Stores, 
Nagercoil, India, 1973, Rs. 4. 


: Paaratitaacan paatalkalil iyaR- 
kai (Tamil) Meenakshi Puttaka 
Nilayam, Madurai, 1973, Rs. 4. 


: Essays on Kamban, Calcutta, 
1973 Reo, 


: Ceelam maavattak  kooyilkal 
(Tamil) Jayakumari Stores Nager- 
coil, 1972 Rs. 4.50 


: ‘nooval noovalistiinre kaalccap- 
paatil’ (Malayalam) Kerala Uni- 
versity, Trivandrum 1973. Rs. 2.50 


: A Maranao Dictionary, Univer- 
sity of Hawaii Press, Honolulu. 
U.S; AsSieGy, 


: Folklore of Tamil Nadu, National 
Book Trust of India, New Delhi, 
1973, Rs. 8. 


ijdl Vol. Ill No. 1 199 


M. E. Manikavasagom Ed : : Prof. A.S. Narayanaswami Naidu 
Commemoration Volume, G. V.N. 
College, Koilpatti, Tamil Nadu 
1973 (RSs. 2o 


C. K. Karim ‘Kerala Under Haider Ali andTippu 
Sultan, Paico Publishing House, 
Cochin, 1973 Rs. 40. 


M. Isreal : The Treatment of Morphology in 
Tolkaappiyam, Madurai University 
Central Co-operative Stores Ltd. 
Madurai, 1973, Rs. 7. 


N. Ravindran Ed: : Journal of Kerala Studies, Vol. I 
No. 1, University of Kerala 
Trivandrum India, 1972. 


L. Gloria Sundramathi Canka Ilakkiya - Marutam-Tamil 
Department, Kerala University, 
Trivandrum 1973 Rs. 7.50. 


Exchange 
Language Vol. 49 Numbers 1, 2, 3 and 4, 1973. 


International Journal of Dravidian Linguistics 


PUBLICATIONS OF THE D.L.A. 
(Available at half of the published price for Life members) 


}3, 


Price 
Vowel Duration in Malayalam-An Acoustic 
Phonetic Study S. Velayudhan, Ks--4,00:0.5.$ 1° 00 
Telugu—An Intensive Course, 
N. Sivaramamurti Be, 2250-0). S.5.-01 . 30 
Souvenir of the First All India Conference 
of Dravidian Linguists Rs. 4.00 U. S.$ 1 00 


Malayalam verbal forms 

V.R. Prabodhachandran 
Intensive Course in Malayalam, 
A. P. Andrewskutty Rs. 6.00°U. S. 3 1 
Bilingualism—a bibiliography 

R. Solomon Rs. 4.00 U.S.$ 1 
The International Journal of Dravidian 

Linguistics [the Semi] Official Bulletin 

of the DLA (Free to members) ] Yearly 

Subscription Keyes. So. 5 5 
Proceedings of the First Conference 

of Dravidian Linguists, Asst. Ed. E. Valentine 

Ed. V. I. Subramoniam Rs. 40.00 U. S.$ 10 
Language of Middle Malayalam 

P. Ramachandran Pillai Resoueie. ©. 12 
Muslim Dialect of Malayalam 

G. K. Panikkar Rs. 30.00 10 
Debates on the Grammatical Theories in 

Malayalam Ed. V. I. Subramoniam Rs. 6 00 1 
Practical Grammar of Tamil by 

Beythan, Translated from German to 


Rs. 30.00 U.S.$ 8 


00 


75 


ee 


00 


00° 


00 


00 


50 


English by Wulfrom Gude in press 


Seminar on Dialectology 
Ed. V. I. Subramoniam, Asst. Eds. 
P. Somasekharan Nair and B.Gopinathan Nair Rs.12.50 3 


D0 


. 


Statement about ownership and other particulars about the ijdl 
Dept. of Linguistics University of Kerala 
FORM IV 
(See Rule 8) 


Place of Publication Trivandrum 

Periodicity of its Publication: Biannul 

Printer’s Name: V. 1. Subramoniam 

Nationality : Indian 

Whether citizen of India: Yes 

(If foreigner, state the Country 

of origin): 

Address : V. I. Subramoniam, Professor and Head of the 
Department of Linguistics, University of Kerala, 

Publisher’s Name: do. Trivandrum. 

(Whether citizen of India ?): Yes 

(If foreigner, state the Country of origin): 

Address - Department of Linguistics, University of Kerala, 

Trivandrum, 


Name and address of individuals 

who own the newspaper and partners 

or shareholders holding more than 

ene percent of the total capital: Professor of Linguistics, University of Kerala 
Trivandrum 


I, V.1I.Subramoniam hereby declare that the particulars given above 


are true to the best of my knowledge and belief. 


Trivandrum, V. I. SUBRAMONIAM: (Sd.) 
1-1-1974. Signature of Publisher. 


We regret to record with anguish the premature 
death of S. Rudin (Leningrad), member of the Board 
of Editors of the ijd/ in a boat accident. Further 
details are awaited from M. Andronov. Rudin was 
an impressive teacher of many Indian languages. His 
reputation as a researcher is considerable. His 
humility is only matched by his devotion to scholar- 


ship: 
Pditor. 


> 


DRAVIDIAN LINGUISTICS ASSOCIATION : 
OFFICE BEARERS 1973-74 


President : 
Surnad P. N Kunjan Pillai (Trivandrum) 
Vice-Presidents : 
M. M. Bhat (Bangalore) 
S. Bhattacharya. (Calcutta) 
Bh. Krishnamurthi (Hyderabad) 
Secretaries : 
| R. C. Hiremath (Dharwar) 
K. Mahadeva Sastri (Anantapur) 
V. 1. Subramoniam (Trivandrum) 
Committee for Tribal Languages : 
Francis Ekka 
Hans Stresser 
Working Committee Members : 
_H. M. Nayak (Mysore) | 
Puthusseri Ramachandran (Trivandrum) 
C. R_ Sankaran (Poona) 
M. Shanmugham Pillai (Madurai) 
_ G.N. Reddy (Tirupati) 
_K.K. Raja (Madras) 
President, Linguistic Society of India (Ex-Officio) 
: G. K. Panikkar (Trivandrum) - Treasurer 
Representatives in Foreign Countries : 
M. S. Andronov (U..S. S. R.) 
~R.E. Asher (U. K.) 
~R. Sadasivam (Ceylon) 
=~. Schiffmann (U.S. A.) 
Kamil Zvelebil (Heidelberg) 
Fellows : ; . 
Senior Fellows A. Veluppillai and K. N, Ezhutaccan 
Junior Fellows G. V. Natarajan and N. Rajendran 
Membership : 7 
. (a) Life membership (Lumpsum Rs. 200/- or U, S. $ 50.00: In - 
ten equal & successive instalments to begin from 30th 
July and end in April, Rs. 220/- or U. S. $ 60.00) 
(b) Institutional membership (Lumpsum Rs. 400/- or U. S. 
F00,00) >. . 
(c) Ordinary membership (Rs. 20/- or U. S. $ 5.00) 
TOsGentict +5 
THE TREASURER, Dravidian Linguistic Association, 
Department of Linguistics, University of Kerala, 
Kariyavattom P. O., Trivandrum, S. India. 


BOOK. REVIEWS 


/ 


Two copies of the books to be reviewed should be sent. 
Twentyfive offprints will be given to the reviewer and ten to ‘the 
publisher. 


SUBSCRIPTION 


_The annual subscription from 1st January to 31st December 
each: year is Rs. 25/- for India. U.S. $5.00 for countries other 
than India. Single number Rs. 15/- or U.S. $ 3.00 each. 


~The members of the Dravidian Linguistic Association are © 


Phe] 


Hy 


eligible to get a free copy each from the Treasurer of the D.L. AOA 


Accredited Book sellers are eligible to get 25% commission he 


for the Journal. 


ADVERTISEMENT fe ge a 


Only from publishers and academic institutions advertise- 


ments are accepted. One insertion, full page Rs. 300/- or U. er 
$ 60.00. For six insertions, full page Rs. 250/- or U.S. $ 50.00 each. 


bg 


CORRESPONDENCE 4 +. 


All matters relating to subscription, sales, advertisement and ae 
change of address of the subscribers, manuscripts, and books for 
review should be addressed to the Editor, ijdl., Department of 
Linguistics, University a Kerala, Kariyavattom P. 0. Trivandrum, Bes 
S. India. 


; se Please note that cheques drawn may be in favour ae ‘the ee 
Registrar, University of Kerala, but sent to the Journal's Office. © | a 


* 
a Ae 


: Edited | hans published by V.I. Subramoniam for and on behalf of the © 


Department of Linguistics, University of Kerala, Kariyavattom P. O. Trivandrum and 
printed at the University Co-operative Stores Press, Trivandrum. Copyright with the 
University of Kerala, Trivandrum. 


the ah) 4 
5 


eS ’ 


